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SEARCH £NGD(£ detennined accoiding to the programmed cTileria of the 

search eogtne^ and the emphasis placed on particular types 

This q>plication is related to US. Application No. Ser. design, rather than any measure of the actual useis' 

60/078499 entitled "Improved Search Engine'' that was opinions. Indeed this can lead to the absurd situation 

filed on Mar. 16, 1998. 5 whereby in an attempt to ensure a favorable rating by the 

most oommonly used search engines, some designers delib- 

FIELX) OF THE INVENTION erately configure their sites in the light of the previously 

The present im^entioo relates to a method and apparatus "^""^^^ ^ detriment of the prescntetkm, 

»i. * n r I- I J . u ^ J readabihty and content of the site. 

that auows for enhanced database searching, and more 

particularly; for use as an internet search engine. SUMMARY OF THE INVENTION 

BACKGROUND OF THE RELAIED ART J^^^ °f. P"^"^ invenUon to ameUorate the 

aforementioned disadvantages of conventional search 

An eflScient and practical means of obtaining relevant engines by harnessing the cerebral power of the human 

information and also screening uowanted/uninteresling operator. 

infonnation has been an ongoing need, especially since the It b a further c^jed of the present invention to prvyvide a 

inception of the internet This need is particularly acute at novel search engine with enhanced e£Scieiicy, usability and 

present due to the exponential growth in the iramber of effectiveness with a reduced system storage and/or conopu- 

woild-wide web sites and the sheer vohmae of information tational requirements in companstm to existuig software 

contained therein. In an attempt to index the information ^ engines^ 

avaflable on the internet, a number of software search It is a fiuther object of the present invention to provide a 

engines have been created via which a user enters a search variety of indications oi the popularity of the search data, 

command comprised of suiuble keywords from a keyboard together with an indication of its date of creation or imdat- 

at bis personal computer. The search amuhand is transmit- jpQg^ 

ted to a server computer, tbehas a search engine associated ^ in order to obtain the above recited advantages of the 

with the server conyuter. Tbe search eiigine receives the invention, among others, one embodiinent of the 

j^coomand,andtbenusmgits^ „j i^y^^^ p^dcs for a method of updatipg an 

through a datab^ of web addresses and the text stored on ^^^^^ d^j3j^ results of a^IL's 

^ of the s^ are trans. ^^^^ ^ lists from the general web 

Emitted fiomtteservercomputert^ckto ^ 3^ ^ ^ to the user as a result of his initial 

and displayed on the screen of the user's computer. j,^^ By updating the daUbase with the 

In order for the search engine to be aware ofnew web sites selections of many different users, the database can be 
and to iqKhte its records of existing sites, either the prppri- updated to {^oritize those web listings that have been 
ctoisofthe web sites notify the seardi engine themselves or selected the most with xcspcd to a given keyword, and 
the inforaaation may be obtained via a 'web crawler' to 35 hereby presenting first the most popular web page listings in 
update the daUbase at the server computet A web crawler is a subsequent search using the same keyword search entry 
an automated program which e]q>lores and records the In another embodiment of the present invention, a method 
contents <rf a web site and its inks to other sites, thereby of determining content to provide along with listings trans- 
uding between sites m an attempt to mdex aU the current ^^d from a server computer to user sites is provided. In 

40 this embodiment, there is obtained a content listing from 

This database structure and method of seardui^ it poses each one of a phiraH^ of different developer ates. Eadi of 

some significant difficulties The internet growth-rate has the content listing includes content, a developer identifier, 

resulted in a substantial backlog in the scanning of newsites, and a keyword, and a keyword selection factor. Thereafter' 

notwithstanding the fact that web sites are frequently there is detennined a particular keyword from the obtained 

deleted, re-addressed, updated and so forth thus leaving the 45 keywords that b the same for different content listings For 

search engine with outdated and/or misleadiiig informatiop. that partkidar keyword, the keyword selection factor is used 

Although the web crawlers can be configured to prioritize in determining when to transmit different content listing to 

possible key-words according to their location (title, embed- the user sites. 

ded link, address etc), nevertheless, depending on the type of in still another embodiment, there is provided a method of 

search engine used, substantial portions of the web site text 50 updating a keyword table with the results of a user's 

(often invohring the majority or even aU of the site text) is selection of ^cific keywords which were obtained from a 

still required to be scanned, this results in colossal storage list of related keywords presented to the user. By updating 

requirements for the search engine. Furthermore, a typical the database with selections of many different users assod- 

key woid search may bring up an excessively large volume ated with I same keyword, appropriate teywords can be 

ofmatcrial, the majority ofwhich may be of mUe interest to 55 provided and presented first when that same keyword is 

the user. The user typically makes a selection from the list subsequently entered, 
based cHi the brief descriptions of the site and explores the 

cbosea sites UDlil the dejaied irfonnation is located. B^EF DESCRIPTION OF TOE DRAWINGS 

These results are in the form of a list, ranked according to These and other advantages of the present invention may 

criteria specific to the search engine. These criteria may 60 be appreciated from studying the following detailed descrq>- 

. range from the number of occurrences of the key- words tion of the preferred embodiment together with the drawings 

anywhere within the searched text, to methods giving a *° which: 

weighting to key- words used in particular positions (as FIG. 1 illustrates certain of the overall features of the 

previously mentioned). When multiple key-words have been present invention; 

used, sites are also ranked according to the number of 65 RG. 2 illustrates various inputs to the search, and, for 

different key- words applicable. A fundamental drawback of each often different capabilities, illustrates the outputs that 

all these ranking systems ts their objectivity — they are will be provided engine according to the present invention; 
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3 4 

FIGS. 3A aod 3B illustrates an overview of the process by typically, a microprocessor, memory and modem, a certain 

yiiicb web pages an; selected in making up the search results of die computers contain delays and the like, as arc well 

provided to the end user accord to the present invention; known. As shown in FIG. IB, a plurality of user sites/ 

FIG. 4 illustrates the data sets used for different web-page computeis lOOA-lOOD are shown, as are a plurality of 
seairchcs according to the present inventioD. S server computeis 1Q2A-^, and developer sites/computers 

FIG. 5 ^ows c various data sets previously describe and IWA-B. It is understood that in a typical internet network, 

various inputs and actions that reailt in a list of suggested different serva^ computers 102 can be interconnected 

web pages being provided according to the present inven- together, as is iQustrated. Further, while only a few user 

tion; developer sites and server computers are ^xywn, it is 

nc. 6 flhistratcs the implementation of a popular search "^T^^^^ thousands of such computeis are intercon- 
according to the present invention: 

FIG. 7 iDustrates the implementation of a hot off the press ^ ^ embodiments of tee present invention 

search according to the present invention: ^ apphcatoons in which the mvention is impb- 

cir* ttni.^ « 4k - 1 * r t.- - mentedassequencesofcodcdprogrammstiucticHisoperated 

na 8 iUustrates the implementation of a high-flyers 15 „pon by a server computer 102 as ilhistrated, it ^ISl be 

search acoordmg to the present mvention: understood that certain sequences of these program instruc 

FIG. 9 ilhistrates the implementation of a random search tions could instead be implemented in other foims^ such as 

according to the present invention: processois havmg specific instmcdons specifically tailored 

FIG. 10 iDustrates the implementation of a previous past ^ applications described hereinafter, 
favorites search according to the present invention. ^ As will be iUustrated hereinafter, additional operations, 

FIG. 11 illustrates the implementation of a collective transparent to the usei; are implemented in order to obtain 

search according to the present invention. seardi results in the future based upon currently made 

FIG. 12 iUustrates the implementation of a date created searches. As shown, the present invention has various 

search according to the present inventioiL 25 of which are ilhistrated in a paraUel flow 

FIG, 13 iUustrates the implementation of a customized Alustratcs an overview of the di£ferent 

search according to the present invention. c^^^abflitjcs can be ongoing sanultaneously. In teims of 

na 14 iUustrates the implementation searching based ^'l„T?f^ T ^t^7 ^^^^^^ 

upon a group identity acco^ to the present invention. ^^u. "^^^ ^ 

HG. 15 iUustrates a keyword eHnunator feature according ^ i„ ft,, „k ^ t-* . i_ i -* j i. 

to tfic present invention. In wder for web pages 12 to be selected by a user 

cf^ n«^« r J . . ^ ^. ^ aoxmiing to the present invention, there is a Step 18 in which 

FIG. 16 iUu^^ the process of determimng which the type of search to be performed is selected. Thereafter,^ 

^ "^1^ """^ up the cumulative step 20, search input obtained fiom one of a variety of 

surfer trace table aoconhng to die present inventwn. ^ sources is input and used abng widi the algorithm seh^ 

HG. 17 mstates active suggestion of web pages according in stq) 18 to determine search results. The results of this 

to the present invention. search are then di^layed to the user, as shown by steps of 

FIG. 18 iUustrates passive suggestion of web pages displaying a aeated list of web pages, delaying passively 

according to the present invention. suggested web pages, and dis|^aying actively suggested web 

FIG. 19 provides an overview of suggesting keywofds 40 pag^ identified as steps 22, 24, and 26, respectively, in FIG. 

according to the present invention. 1- This capability, and how it is implemented, wiU be 

HG. 20 iUustrates the manner of creating data sets for <*cscaribed in more detaU hereinafter, 

suggested keywords according to the present invention. 1° order for keyword suggestion to take place, which the 

FIG. 21 iUustrates a variety of manneis in which a list of may or may not select, there is preferably an initial step 

suggested keywords can be created according to the present 45 28 in which the type of keyword search olgorithm to use is 

invention. selected. Although many systems may have only one such 

RG. 22 iUustrates how content is attached to web page various ones, as described hereinafter, are pos- 

Ustings according to the present invention keyword search olgoritfun is selected, step 

nC. 23 iUustrates various content daU sets and opera- ^^^ll?'^ ^ "P^f / ^'T''''^ ^^"^ * 

tions that populate them according to the present invenLi. ^ ?^ ^^"i!^^^* 1 "^"Hf ""^""^ ^ 

CTi^ • . • ^ y Mu^uuiju, detenmne associated keywords. The results of this operation 

HG 24 iUustrates vanous^^^^^^ are then displayed to the user in 30. This capabflity,ial how 

tions that are used to select data from them a acoordmg to the it is implemented, wiU be described in mo«^ deSa herein- 
present invention. 

"^"^ The previously mentioned web pag, and keyword selec- 

daU accordmg tq the present mvention. capabibdes inured to the diiect ^nefil of the end user. 

DETAILED DESCRIPTION OF THE Another novel feature of the present invention, which indi- 

PREFERRED EMBODUVfENT rectly inures to the benefit of the end user, directly benefits 

T^#-o * J* - the advertiser, because it allows for content to be targeted in 

FIGS. lA and IB dlustrate certain of the overaU features m real time based tspon various criteria. As wiU be described 

of the present mvention, whidi wiU be described in fiirther more fiiUy hereinafter, a content providing olgorithm is 

detail heremafier. It is miliaUy noted that like numbered initiaUy selected which wiU determine how content is 

reference numerals m vanous Figures and descriptions wiU selected in step 34. Step 36 foUows, and based upon inputs 

be used in the followmg descnptions to refer to the same or from users and content providers, which content to show is 

similar structure, actions or process steps. 65 determined. Thereafter, the advertisements are displayed for 

The present invention is preferably implemented in a the user to sec, simultaneously with the di^lay of cither 

network eovirooment wherein each computer contains, keywords aiKl/or web pages 
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While FIG. 1 illustrates certain overall features according appropriate^ the structure of the dependent data-sets used to 

to the present invention, many of the advantageous features create the defined data-set are shown in tabular form) with 

of the present invention are not, as mentioned previoudy, reference to the preferred embodiment of the pnsciA ioven- 

observable to the user, but iiistead tran^arent to user. Hiey tion. Thereafierj certain of these will be explained in even 

are, however, significant in order to fully explain 5 greater detail to fiilly teach how to make and use the present 

present invention is implemented and are explained herein- invention. 

Locatiois: a plurality of unique informatk>n entities. 

FIG. 2 is provided to illustrate various inputs to the search Web-pages: Locations in the form of Web^agjes URL 

engine according to the present invention, and, for different Universal Reference Locator) adcfresscs. 

capabiHties, iUustratcs the outputs that wiU be provided, lo Key-word: The wofd or phrase that is entered in the 

More detailed explanations arc provkled hereinafter. Data search engine 

that is potentiaMy input ftom search engine user inchidc: ^^^^ ^ web^pages URL addresses) that is the 

keyword 52— this is the word or phrase that the user cater result of the key-word search. Ibis hit-Kst ranks the rel- 

to find a list of web pages cvance of the web-pagps relative to the key-word. Ibis 

profile types 54-^ese are the groups of people they bit-list always has a key-woid associated with it. 

associate them selves with e.g. US, male, doctor etc 
User ID 56— this is a unique identification for each user 

that chooses to register with the search engine. This can 

be done via a cookie or logon. 20 
search type 58 — this can be actively diosen by the 

searcher to de:termine the type of search results they 

would like (popular, new, etc) 

date-time 60 — this is passively recorded when a seardber 

uses the system 25 Permanent data set: Retained k>ng tern (although it changes 

IP address 62— this is pasavety recorded when a seardier overtime) 

uses the system Temporary data set: Created only for the duration of the 

other 64 — this includes other personaHzatioii idformatu>n ^ scardi ^ ^ 

such assearcfa customization preferences, keywords for ^ ^ * measure of bow users scardt It is a 

web page suggestion etc This information is entered ^ of the key words they search for, the URLssubse- 

actively once by the user then used to personalize the quently selected and how kmg they ^>eDd there, fiom 

search results each time the users (identified by iiser wfaidi a ranking of web^ages for a users (surfers) can be 

ID) uses the search engine. calculated. It is a measure of which web^ages they foimd 

Data from web-page develq;»ers include: most usefiil after the key-word search. The combination 

URL 66T-this is the URL address of die web pagp or of all surfer traces is used to create a users' choice hit-list, 
pages that they wish to submit 

description 68 — this is a 2-3 line description of the 

information on their web-page LpotdBbfct Outp^^sa 

keywords 70-^-thcsc are the keywords that the web page 40 



Input data fct 


Ootpul data set 




Hit-list - RaaikBd hil'^ist of 


Database to iiiBlch the kcy-woid with 


Wrf>-pages (ics^poKaiy) 


(pcfiiuuient} 





devcfeper^uld like to associate tbdrwcb-page with S^TSSte^Lw ^^^Jt'Sl^lSw f„ 

^ user seleciiotts nom mmal seaicn weo-pages Qseis uxaa nsenu io€ 

target audiCTce 72— these are the target audience dvofile i»iUta O«npo»iy), ie. w* pages eachkEy-woid(canbepttnia2ieDtor 

types 54) that the web page developer particularly want visited (urls) tempomy) 

to target Times ^wnt a each URL 

' . IP address of user 

date-time 74— 4his is passively recorded when every a - - 

web-page developer submits a web page 

Data from content providers inchide: Users' choice hit-list* Tbis a semi-permanent rankiri^ of 
bids 76-these are $ bids for content as described hter, web-pages associated with every key-word and indicates 

« « ^ . 1 .I.- - i„j « J ^ M ^ . . bow usefiil Internet users found each of the web-pases 
content details 7ft— 4his mchides all details of content • ••u *». 1 _j * ^ • 1^.1- . 

. , • ^ J J . J * -1 . 50 associated with the key — word The users dioice hit-list 

providers including address, content details etc. . . ^ „ j / j l ^ . 

R«4iills &oa. olbeTWhengiDes 80-these are Ibe incrementJly updated by a new surfer tiace. 

results for a keyword search fiom other existing seardi 

engines. ^ 

Outputs of the seardi engine 10 are: iiput ^ ^tt Oatpat dau set 

lists of web pages 90— defending on the input data a list ~ ' — ' ;; — , 

r^f r^^u .w.^..*^ :« Soifer tiace (caa be pcnnanent or New Usew' choice hit-list - Ranked 

Of web pages can be produced m web page deteraii- tempoiaiy) hit-list of >)puiar w*i»g)Bs , 

nation step 82, described fiirther herea^r; usecs* choice hit-liat (pcnnaneiit)* (pecnuuieDt) 



The initial osera* dioioe hit-tist will be the surfer trace. 



content keywords 92 — the seardi engine suggests other 

keywonk for users to try produced in key word deter- ^ 

mination step 84, described further hereinafter; and New web-page list This is a list of new web-pages that is 

content 94-Tthe search engine sends out selected content created by URL submissioDS from web-page cfevelopeis. 

as produced in determine content step 86, described When a web developer updates a wet>-page» they can 

further hereinafter submit the web-page address, brief information atxmt the 

To facilitate ease of reference and aid understandii^, the 65 page and a list of key- words that the developer decides are 

aforementioned and subsequently mentioned dala-set defi- relevant. The web-page is then placed on the top of each 

nitions are reiterated and expanded upon below (and where of the key-word new web-page lists. 
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Iii(rat idatB set 



Ootpat d&ta set 



AO wdi-pagc dcvclopcn infonnadon New wcb-pagc list (pcnnaneat) 
about addiess and. kcy-wdnte 
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of key-words rather than \^^>-page The method for doing 
this uses a similar olgorithm to a speH-checfcer only it 
docs it for phrasesw It also suggest Key-words^ based oo 
previous URL selections from sequences of user key- 
words. 



Content Provider's list: This is a list (associated with each 
k^y^'wotd) of content providers which must typically 
[that] pay to illustrate content with the key-word. The lo 
price paid is dq>endeDt on the number of other content 
providers, the amount they spend and the number of times 
the key word is searched for. 



Input data set 



Output data set 



Key-WDid (bmpomy) 
Datetiasc of all key-wuids used 
(pcnnascot^ 



Ranked hit-tut of other key-words the 
user may want to try (tenqxnaiy) 
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hip«< data set 



Output data set 



Key-wofd 

OootcDt Pnnrider's bids for ooDtenl 
ipots 



Content PiDvtden Ust - a list of 
cootcot associated with each 
key-wDrd penmuieDt) 



Surfer keyword list: This is a data set comprised a list of 
key-words that the iridividual user found useful afier the 
key-word was selected. Hiis is found by traddng which 
key-words the user decided to use. This is equivaloat to 
the suffer trace. 
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High-flyers hit-list: This a list of wcb-pagcs (associated with 
every keyword) that are increasing in pcpularity at the 
highest rate. It is an indication of how rapidly web-pag^ 
are rising iq> the users* choice hit-list and it is used as a 25 
means to ensure that new emerging web^ages rise to the 
top of the users' choice hit-IisL 



Iqmt data set 



Ootpot data set 



Key^woid (tempoiaiy) 

Data about what key woids w 

Ciom the key-woid snggcster 



Ranked list of other ky-woidt 
; wed (associated with the kcy-woxd) that 
this individDa] user found 
(venn-pennanent) 



Ibput data set 


<Xitpat data set 


Old Usen' choke hit-lat- 


Ifigh-a>cis hit-fist: A lankBd tnt of 


(ftxapomji) 


wcl^^iages that are nsto^ tn populaifty 


New Uaen' cfaoaoe hit-list - 


the fastest 


(jtfjgiMient) 
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key-word suggesten This is a daU set consisting of a 
permanent ranking of other key-words that users have 
found useful^ ooiiq»iled from snooessive surfer key-word 
lists and is linked to each key-word (this is equivalent of 
the users' dioice hit-list). 



3S 



Input data set 



Output data set 



Personal hit-list: This a list of web-pages the individual user 
has found most useful for each key-word search they have 
done in the past It is like an automatic book-marking data 
set for each individual user. 



&u&r key^word Hit (ten^ or 
pcnnanent^ 

Existtng oscn' dunoe hat Ust 
^>cniianent)* 



New usen' chcaoe key-woid iist 
^vcnnancnt) 



Ii^iut data set 


Output data set 


KeynvDid 


Ftenonal hit-list: A xankcd fist of 


Individual surfer trace - ^imancnt) 


wdhpages that an individual has 




fiMud useful in the past 
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Collective Seardi hit-li^. This can be a combination of any 
of the above hit-lists. There are many different ways a 
these hit-lists can be combined. 



Iiqrat data set 



Output data set 



Oawler hit-Ust (temponiy) 
Users' <dx>toe hit-tist (pennanent) 
Advertisen* fist (pennaneitf) 
New web-page fist (pennanent) 
High-ftyeis list (permanent) 
Personal hit-list (persBoeot) 



Collective Search hit-lists - (Defimh) . 
Ranked hit-Ust of Wcb-pages 
(fispbyed to the user after the key- . 
wofd search. It can be a combinatiott 
of any of the hit-fists above 
(temporary) 



Crawler key-word list: This is a list of key-word sugg^^stions 
- mat the user may find useful. This is found by matching 
the key-word entered by the user to the database of 
key-words and phrases that other users have tied. This is 
the equivalent of the crawler hit-list, though it is a ranking 



User Based Search Algorithm 

The discussbn iffovided above provides the language 
necessary to more folly describe the present invention As 
illustrated in FIGS. 3A and 3B, widdb provide an overview 
of the search engine capabilities acoofding to the present 
invention in \tduch web pages are selected in making up the 
search results provided to the end user. In step 112, the user 
enters up to 4 sets of data; keyword 52, profile type 54, 
search type 58 and User ID 56. The IP address 62 and 
50 date-time 60 are not entered by the user but can be read whra 
a user uses the search e . This data is used is used in parallel 
m steps 114 and 116 to produce Est web pages. Step 114, 
discussed in detail hereinafter, is the process of selecting 
web pagies fiom novel new search engine data sets produced 
55 in accordance with the present invention. This can run, if 
desired, in parallel with step 116 which obtains a selection 
of web pages from other existing search engines. Hiereafler, 
selection of web pages from step 114 and 116 are combined 
and tagged in step 118. llie process of tagging the list of web 
60 pages, describe in more detail below, enables a set data, 
shown as surer trace data in FIG. 3, to be created and sent 
back to the search engine when the search engine user 
selects awe-page from the list in step 120. The process of 
selecting a tagged web-page creates the following series of 
65 data which is used to update the search engine data sets; 
keyword 124, URL 126, user ID 128, IP address 130, 
dale^time 132, brief web page description 134. 
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Although it is prefened to use all of these different data embodiment, each applet contains all of the infonnation 

types in the suifer the daU, use of different combinations of necessaiy to utpdalc the database at the search engine, 

this data is filly within the intended scope of the present Another embodiment collects the suffer trace data prior to a 

invention The description 134 wiU typically only be user navigating to the mtended web site. Other ways of 
induded in the preferred embodimait of the invention when 5 obtaining this surfer trace data are possible and are within 

anew site is added to the data set 114 of the search engine intended sct^ of the present invention. 

10, and the description used will be that descrq>tion that . Thas, the search results paige according to the present 

appears on the originallist of web pages. Ibe date-time data invention is therefore differently formatted from conven- 

132 may only indicate that a site was selected, rather tm scmAi engines' rcsuhs pages. TTie difference is in 
record the period of time a user was at a partidilar site, as lO a<^»on ^Iber than content A^affy, the page kx&s the same 

ciq>lained further hereinafter. This process is invisible to the ^ the user as standard seardi results from other search 

user who, upon selecting the web-page fiom the Kst of web ^""^ ^ ^"^^ In a conventional 

pages is taken difxxUy to the corroding ^ * of Uie keyword 

ri^ t f.u - 1 . cZ^\7A%^^^f7£i^l ^"feather" may read: 1. www.wcatber.oom Today's weather 

""^LT^^^^ 118 and 120 will Today is expected to be fine ad somiy ^mywbere 

A^'^^.'^.T"'.^^"'"^'' ^ ITie imp link as^ed with the -W.Watte^ 

After the mitial selection the user may choose to access ial>cl is "http: Vwww.weatberxom". This oieans that if the 

another of the web-pagp URLsearch results. Depending on user selects this link, they wiU navigate to this page diiccay 

the relevance of the site, the nserinay ^>end time rcadir^ In contrast, acccMding to the present invention, the tagged 

downloading, entering further pages^ embedded links arid result page for the search made suing the keyword 
so forth, or if the site q>pears irrekvantAminterestii^ the 20 "^A^atber^ may read 1. www.wcatber com Today's weather 

user may return directly bad^ to the search results after a forecast Today is expected to be fine and sunny everywhere, 

short period The time difference Ixtween the two selectioiis The HTTP link asspdated with the 'Vww.weath^xom" 

is recorded as the differeiice between two date/tinie data 132 labd is linkaspTn-l." If the user selects this Hnk, therefore, 

from subsequent selections from the list of web page in a process is invisibfe to the user, the user is first directed 
searches (in this embodinoent one can ority measure the tinie 25 to the Hnk a^ page on the site oonespooding to the web 

^nt at one web page if another selection is made after server using the search engine 10 according to the present 

visiting that web page — this then provides another surfer invention, and pass parameter n with value 1. 

trace 132 which allow a time difference to be calculated). Server &dc code (plication code that runs on the web 

This surfer trace data on the popularity of web pages is used server) uses this parameter to identify Me URL and descnp- 
to the subsequent searches, as described further hereinafter. 30 tion of the user's chosen ate. This inforriiation is then stpied 

Thus, according to the present invention, it is the human in a database Table alor^ wit other surfer trace data. The 

users' powers of reasoning and analyas that is beg used to server side code then executes a redirect operation to the 

establish the relevance of the different results to the subject user's required URL. Hie user then sees their required page 

matter of the search. Hie present inventbn utilizes tie appear. 

cumuktive processing and reasoning of an the faumaii users' 35 The source ofsean^ results is independent to this activ^. 

to provide a vastly more effective meais of obtainiiig the The destination page of the user is independent of this 

required information sources than is presently possible with activity. The process is one of recording a user keyword and 

the type of method described above destination into a database. This method of trackiiig can only 

As described above, human brain power is captured by record the initial web-page visited after a keyword seardi. If 

recording which web pages the user goes to after each 40 the user continues to return to the search results list then 

keyword search. Acoordipg to the preserit invention, collect- subsequent web-page viats can be recorded, 

ing the suifor trace daU is achieved by sending, in the list of The surfer trace daU that is sent bade to the date sets 114 

web pages generated by the search to the user; hidden links of the search engine 10 as a result of the user selecting the 

that will automatically send ihfi[Hmation back to the search we-page can be encrypted to prevent fraudulmt users from 

; engine (or a subsidiary server). While the user only sees that 45 sending fake data to the search engine, 

his intended link is di^layed, the hidden link notifies the Another method of tracking ^^lere a user may connect to 

search engine of d)e transfer, which process can be executed from an initial URL selection (if they do not return to the 

with a Java applet Thus, when the Internet user selects a search r^ult page) is to run the selected web^ages as part 

web-page it takes the user to that address but also sends off of a 'frame' located at the search engine web-site. This 

the surfer trace data to the search engine 10, wtncfa notes SO permits a complete record of the web pages v^ited to be 

what has been selected When the user returns to the list of recorded after a keyword is entered. However, this imposes 

web pages and selects another web page listing, another Java an additional level of complexity to the system with a 

aj^let is then executed which creates another surfer trace. possible decrease in system re^xHise time. 

Hie difference between the data time data in this surfer trace As prevK>usly mentioned, the surfer trace data that can be 

from two sequential selections captures the time period that 55 collected indudes keyword 124, URL 126, user ID }2S^ IP 

the user bas been at the previous wd> site. This occurs address 130, date-time 132, brief web page descr^tion 134, 

without the use knowing this data is being sent and is identified as such since it provides a trace or record 

In another embodiment, rather than using multiple Java of how searchers (surfers) use the seardi engine. This data 

applets to collect a complete list of surfer trace data, there is is used to improve future searches building on the piefer- 

no description data 134, and the date-time data 132 indicates 60 ences of i^evious searchers. The surfer the is thus a measure 

that a user visited a particular web site. In one specific of the preferred choices of an individual user or web 

embodiment, the user must visit a particular web site for 'surfers' from the initial search results for a particular set of 

greater than a predetermined period of time, such as one key-words. 

minute or fifteen minutes, depending on what is an appro- How the Data Sets are Created that Determine the List of 

priate time to have looked at the site for the visit to the site 65 Web Pages 

to count and for any surfer trace daU to be sent back to the FIG. 4 illustrates the data sets used for different web-page 

search engine 10, as well be described hereinafter. In this searches according to the present invention. The data sets 
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(tables) that are used to determine the list if web pages 
include keyword table 164, profile I£> table 166, security 
table 168, cDuuulative surfer trace table 170, keyword URL 
link table 172, personal link table 174, and web-page (URL) 
table 188. 5 

The structure of the aforcmeationed data sets are 
described in more detail bereinafler. The descriptioiis that 
folbw show the data arranged in a spreadsheet ^shion, with 
multiple values per cell and many blank ocUs. Illustration in 
this manner is conveoieint for explaining the present 
inventiori, but is not an efficient storage and retrieval 
method. As will be apparent to those skilled in the art, a 
relational database model would be used to implement the 
data storage according to the, present invention such that is 
there may be multiple fields or Tables involved to store the 
data and each fiekl wiD store only one vahie. 
Keyvrard l^ble (164) 

The contents of keyword data table 164 of FIG* 4 are 
:^wn in more detail in Table 1 shown below, and is a list 
of keywords inchidirig phrases, and the number of times they 
have been requested, if the list becomes uiunanageab^ 
large, the key-words that are not used again after a prede- 
termined time period could be deleted £rom the list. How- ^ 
ever ts would be dedrable to keep the majority or all 
keyword phrases that are entered, if possible* 
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1ABIJE2 



list of infofination suppliers and a description of the 'wcb-paae 



Address 



Unkjne nnniber 
2-3 line for each Freqneacy tk URL 

description URL address (web page) is visited 



URL address 1 
URLaddiess2 
URLaddxessa 
URLBddie8s4 
URL8ddress5 
URLaddress6 
URL address?. 
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TABLE 1 



List of infonnatioo regoesis and the nnmbct of tunes it is reooegta 



Oimnlatfve onmber of times the key- Unique mnDber for 
Key^mrd wmd is rcqocsted (W) eadi loey^woid 

Key^nord 1 Wl, W2, W3 etc 

Key-word 2 

Key-word 3 

Kef^td 4 

Key-word 5 

Key^vvDfd 6 

Key-word 7 
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40 



Tbe cumulative number of times a keyword is requested 
may be segregated according to the different **users profiles" 
selected (Wl, W2, W3, . . . )^.g. Wl-total searches, 
W2-maIe profile, W3»Female profile, W4-USAprofile and ^ 
so forth. It should be noted that tbe sum of Ws will be 
greater that tbe total number of times a ate has been visited 
because tbe user may CaJl into than one profile category 
e.g. a male<W2) from the USA (W3). This wouM become 
a list of rK>t only tbe number of user seardiers using that ^ 
key- word but also a list of the type of user (according to tbe 
profile type selected) seardiing for that keyword. Keywords 
that mean tbe same thing in different languages are different 
keywords, as long as the spelling is different, although they 
could be related using the keyword suggester, as described 
hereinafter. 

Web-pag^ Table (18fl) 

Tbe contents of web-pagp table 188 of FIG. 4 are shown 
in more detail in Table 2 shown below, is arxl contains a list 
of Internet web-pages. Each web-page has a URL address, 60 
an associated 2-3 line description, a unique web page 
number for each URIj(whicb can also be any character, 
symbol code or representation) and the cumulative number 
of times the URL has been visited Tbe URL address will 
have a unique number (which can also be any character, 65 
symbol code or r^resentation) assigned to it rather than 
storing tbe full URL string in the subsequent data-Tables. . 



Kcyvrord URL Link Table (172) 

The contents of keyword URL lilt table 172 of FIG. 4 are 
shown in more detul in Ikble 3 shown bdow. This table is 
of particular significance with respect to the present inven- 
tion because it ocmtain mfbrmation about the inks t>etwectt 
informatk>n supplies (JJSL addresses or web pages) and 
information requests (keywords). 

This data is recorded in further data sets wfaidi describes 
the relationshq) between the Key-words and occurrences as 
defined by the foUowirig three paranieters. 

tbe cumulative rmmber of significant visits ^lits) to each 
URL ^kfaesses oorrespooding to eadi key-word (herein 
referred to as X or weighting factor This is a 
measure of the popularity of the URL for each keyword 
and is determine horn die surfer traces^ 

tbe previous cumulative number of si^inficant visits mea- 
sured at an earlier predetermined instant; (h^in 
refened to as Y or wei^ting factor Y) 

a date time &ctor relating to the instant of the creation or 
input of each said web-page(herein referred to as Z or 
weigbtmg factor Z). Z is the data time in which a 
web-page developer submitted a wd>^age to the 
search engine. 

Not all corhbinations of key-words and URL addresses 
will have data for X, Y and Z, 



TABLE3 



links Iwlween infonaatkm si^Uea (wel>-|uges) and infommtipa lequcsts 
Key-word Key^woid Key-word Key-word Key-wocd 



URL 


X,Y,Z 


addicssl 




URL 




8ddress2 




URL 




ftddiesB3 




URL 


x,xz 


address 4 




URL 




addiessS 




mh 




address 6 




URL. 




address 7 





Profile Type s with tbe Keyword URL Link Table 

The popularity of web pages will be different for different 
groups of people. The inclusion of multiple profile type s 
will produce multiple values of X Y aixi Z in Table 3, e.g. 
one may have a Global and New Zealand popularity rating 
denoted by XI X2 Yl Y2 etc 
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surfer trace is the combined informatioo from all individual 
surfer traces and it is used to detenniue bow many "bits" 

' (significant visits) each web-page had for eacb k^-word. 

Keyword ^BportB" jbe infdrmatiop collected from eadi individual surfer 

URL address idadDg to RDgby xi-520,X2-52 5 tracc is a SCTics of inputs previously described, and sbown 

URLaddrcss relating to BaskctbaU XI - 4000 X2 - 20 bcIoW in Table form 



In his example the global popularity (using the general 
profile type ) for the Rugby and Basketball URL addresses 
are 520 and 4000 respectively and 52 and 20 respectively fr>r 
the New 2^aland profile type. 

When the general profile type setting is used (ranked 
based on XI), the Basketball site would be ranked at the lop. 
When the Hew Zealand setting is chosen (ranked based on 
X:2) the rugby site would be highest. This would be a 
reflection of the preferences of the New Zealanders. This is 
a very simple method of storing the preference of different 
groups of people. One would expect New Zealand-based 
rugby web-sites to rate higher than an oveiseas site on the 
New Zealand list, but there is no reason that this has to be 
the case. Someone in Spain may have the best Rugby sate in 
the world. The system evaluates wethpages only on the 
peroeiyed quality of information by the users — the physical 
location of the site is immaterial. 

There could be a vast range of X values representing 
different countries, occupations, sex, age and so forth, 
enabling, the popularity of different groups to be captured 
very stmply. Useis could choose to combine any of the X 
values according to their personal interest^diacterisks. 

As an example, if say, 

XI is for maks 

X2 is for females 

X3 is for New Zealanders 

X4isfi)rUSA 

X5 is for engineers 

X6 is or lawyers . . . 

A^mak" and a ''New Zealander^ would using the search 
engine increment both X3 and XI. This fac^ty would 
increase the data requirement of the system but it could 
vastly improve the search results for different users. The 
total popularity of the web-page needs to be stored as a 
separate number as xtscss may contribute to more than one of 
the groups of people. The suin of all of the individual 
pcqmlarily's would be greater than the total popularity 
because user can belong to more than one profile type. 

To simplify the system for the user there would be a 
default profile type (selection of X's) with an option is to use 
other profile type s to do ^)ecific searches. For example, a 
user may have a default profile type of a New Zealand male, 
but if a technical seardi is required a **gIobal engineers'' 
profile type may be chosen that reflects the cumulative 
search knowledge of engineeis around the world. 

The extent of personalization could be dependent on the 
frequency of searching. For example, common keywords 
such as **oews'' would have a high degree of personalization 
(a large range of X values) and less common key-word such 
as "^iglisb stamps" would have little or no personalization 
(only a global X value). The degree of personalization could 
be a function of the frequency that the key-word is used 
(found from Table 1). 
Cumulative Surfer Trace Table (170) 

The contents of cumulative surfer trace table 170 of FIG. 
4 are shown in more detail in Table 4 shown below. 
Information about the links between web pages and key- i 
words in Table 3 ( also referred to as keyword URL link table 
172) is updated by the suifer trace data. The cumulative 



TABLE 4 



10 



Each row is one miier trace and the c 


xnnbtDed rows arc the camulative 


Boifcr b 


aoe 


IP Number User ID KcywDrd 


URL (webpage) DalMinie 



The way the surfer trace data is processed to update Table 
15 3 is described further hereinafier* 
Profile ID Table (166) 

The contents of profile ID table 166 of FIG. 4 are shown 
in more detail in Table 5 shown bebw. This table includes 
a unique identification^ password^ contact email and a 
20 default profile type which they normally use to perform thdr 
searches. 



TABLES 



25 





User identification Ikbte 


User 


Default Other 


jdeotifigirion passwoi 


xl email ' |nD&le tnficnmation 


JoeBloggf dog^ 


jblogg9@AOL US^Male 
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The users default profile type is stored as the part of the 
user's personal pie&renoes profile, wfaidb would accessed 
by entering some form of personal ulemificatioii to the 
system. Th^ infinmation could be so^Hed when logging on 

35 to the data seardi engine or the search engine oould leave a 
"cookie'*, as that term is known in the art, on the coipputer 
to identify a user, (there would be an optional e-mail address 
and password (or similar) associated with the logon 
procedure). The IP address itself would not be a suflicient 

40 means of identification as it is not necessarily unique to the 
individual users. 

The other information can include user defined prefer- 
ences for how the search results are combined and keywords 
that are of particular interest to the user. This information 

45 can be used to actively customize the search results and 
suggestions of web pages to visit 
Personal Link Table (174) 

The contents of personal link table 174 of FIG. 4 are 
shown in more detail in Table 6 shown below. Table 6 is 

so identical in structure as Table 3, and can be used to record 
a users personal preferences relating to each URL including 
the number of times visited and the key-words. In this Table 
6, however, Z is not the date that the web-page developer 
submitted the web-page by it is the date-time that the user 

55 visited the web page. This allow the users could refine a 
search by defining the last time they visited the web page. 

TABLE 6 



links between infonnatiofi supplieis (wcb*fiages) and infonnatioa requests 
fkcv-woids'^ fof an indjvidiiai oser 

Key-word ICcy-word Key-woid Kry-woid Key-word 

UKL z,y;& 
address 1 

URL x,y^ 
address 2 





Key-word 


1 Key-WDid 


Key-word 


! Key-'WDid Key-word 


URL 
tddress 3 










URL 
addicss4 










URL 
»ddic8s5 










URL 










addrcn 6 










URL 










ttddnsi 7 
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in Step 176, as shown in FIG. 4, web-page developers can 
TABLE 6-axitunied submit a URL 187 ad description 6S which also goes 

. ~ — directly into the web-pagc daU table 188, 

l^U^^„^^»^O^^^i„fo^^ web cnjrtcrs may also «UUM.add.esses and descrip- 

— 5 tion$(the descnptioniseitber the fiist few lines of the 

web-page or in the HTML coded "tille*^. Hiis is not an 
essential element of the system but it could be a method 
to obtain URL's and descriptions. With tins search 
system web crawlers are more likely to be used to 
verify the information rather than find new information. 
Populating the Cumulative Surfer Trace Table 170 

The cumulative surfer ace table 170, also referred to 
above, as Table 4, is populated each time a "tagged" web- 
page is selected by a user. This sends a packet of surfer trace 
information, such that the surfer trace data is added to the 

table eadi time the user selects another web page from a web 
page list 

The dato in Table 6 is only accessed by the individual that Populating the Keyword URL Link Table 172 
ctealed it, and accessible using a user ID that is preferably The daU from the cumulative surfer trace 170 is used to 
independent of changes in the user's e-mail or IP address update the popularity of web pages as lecmled in Table 3 
changes and would thus enable their past personal prefer- 20 (X; Y), also referred to as the keyword URL link table 172. 
ences to be retained during such changes* The 

Ibis l^le 6 data set covJd be stored either at the search frequency of iqxlating Table 3 with the data from the 
engine site or oo an individual's computer. Storiqg oo local cumulative ^irfer race (170) to obtain new values of X and 
PC's would require additional software to be irstalled on the Y is a variable that can be dianged, from ranges that are 
users comimter. There are numerous advantages to storing ^ shorter than every hour to longer than every month. It shouM 
the uiformatx>n at the seardi engine inchidirig the fict that ^ that different keywords can be updated at different 
iisers are likely to go there more often and unlikely to change mtenrals of time. 

search engines <Micc they have a substantial book mark Ikt ^ intermediary step in ptoccssmg the cumulative surfer 
Security Table flW ^ ^ ™™ * cumulative surfer hit table. This is 

IlK.contentsofsLrityUbkl(»ofFIG.4areshownin ^ f*^°«y modify the vahresofandX, Yin Table 

^tt™"^ 7 shown beknv. ensure that users do As mentioned above, the simplest method of recording a 
not subnnltte same tey-word over and over to link (-Hiseful visitor «hit^ between a keyw^ 

populanfy d)e foUowmg security data table is used. Each would be to count each keyword, URL paring in a surfer 
entryisasmgl6pieceofmformatk)nie.ycsorno.Thisteble trace as a "hit". A more meaningful and sophisticated 
can be created for Hnks between keywords and IP addresses ^ methodisonly to count a location selection as a valid if the 
or links between keywords and User ID's. user meets certain criteria. This criterion could be the user 

exceeding a specified time at a locaticHi. If (bis criterion was 
TABLE 7 not met, the selection would not be increase the cumulative 

— value of X in Table 3. 

SworityTiibic to ensure cmccomimier user dow not srf 40 It isalso possible to increment the vahie of X based on the 
^^^'^^^cpo^^ci^v^^ ^ at the web page. The k>nger the time spent the 

Key-wDidi Key^woid2 Kcy-wml3 Kcywoid 4 more this increments the vahie of X X docs not have to be 

— — ■ • — ^ a whole number. 

n^Idd!^ 2 ' Due to Ahe variations in web-site ca{)abilities in terms of 

iPaddrestS 45 log-on times, down loading times, bandwidth, and response 

iPftddiess 4 * j limcs, the predetermined time used to denote a valid 'hit' 

iPaddnagS 1 may be suitably altered Si)cdalist web crawlers may be 

■ " employed to independently validate such data. 

Described hereinafter are the processes that are used by , '"^ selection of a content provider's banner after a 
the present invention to populate each of the FIG. 4 tables ^ f.^X^^^ search counte as a hit for their web-page 

mentioned previously (mcrement the value of X). This wiU enable their web pages 
Populating the Keyword Table 164 possibfy go up the popularity list associated with the 

Thistableispopulatedcvery timeauserenteisakeyword T*^ mechanism to enable a web-page 

^ to the searchS-AsuLitted keyword is com^ ^^^t^^^nl'X nS^r?^^^^^ 

tothekeywocdfistinTablel(keywordt4lel64)andadded ^Jfe^^,"L?^^^ 

if it is not already present. If it is present, the cmnulative f^f^T Ume UK^re and record avahd 

numberisincreasedbyone.lftheuserhasaprofiletypethen ^^^^^ P;>Har 1^.^^ 

the cumulative num4 for the keyword f^each^ of ttt'^''^ u 

piofite will also be incremented (W1,W2 W3 ete) "^S. popularity 

^l^t^l^l.^'^T^^^^^^ 11. ^ "TOs'-nulativesurf^ 

This table |s populated m a mmiber of ways, mchidmg: i„ 3 ^^^^^ p^p^^^^ ^.^^^ 8 (be^ 

user selecting a URL address 126 that is not already in Grouping the cumulative surfer trace according to the IP 

Table 2 (URL table 188). the URL address 126 and addresses or user ID prodfuces the search pattern for an 

description 134 arc put directly into the web-page data 65 iiadividual users. This is a list of key-words and URLs and 

table 188. The new URL is assigned a unique identi- tiroes. This allow the lime spent at each web-page to be 

ficaU'on number. calculated for each user (it is not possible to calculated the 
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time spent at the lasi web pages of a search session as there 
is DO time record after they go to that web page) 

If the time between each visit is loiter than a obtain time 
period, one is added to the cumulative surfer hit (a) 
table for the key-word URL. (this is the simplest 
method, methods in which relevancy is proportional to 
the time spent at the site, for example, are ako piopctiy 
within the scope of the present invention). 

TABLES 

cunmlativc surfer hit trf>lc created from acccmmlated wirfcT traces 

Key-word Key-word Key-woid Key-word 

URL address 1 

URLiddici82 a a 

lJRLaddceas3 a a 

URLftddfe$»4 a 
URLwUicnS 

URL«ddicM6 a 
URL»ddR»7 a 

The cumulative surfer hit is used to update the value X in 
TO)ks 3 in the following way X^y^^. HPynt, 

HF is the history factor which is a umber between 0 and 
Llbe history factor does not have to be the same for every 
key- word and could be varied depending on the rate at which 
the keywoid is used 

Ibe data collected for Table 9 is used to recalculate the 
values of X in Table 3 after a predetermined time period. The 
fineqoency of updating Table 3 will influence the value of the 
Hislory ikctor (HF) draserL The reason for mult^lyiqg the 
existing X by a ''history fitcto" is so that the perceived 
pofmlarity does not last indefinitely. The history factor 
reduces the weighting attached to the past popularity. To 
illastrate by way of an example, the key-word "sports news" 
may have an existing popularity with the following ranking 
(ba^ed on the number of hits per web-page, X) 



The reason for the change in the number of hits reflects 
the feet that the winter Olympics has finished and the Master 
golf tournament has stated. If one has a 'iiistory factor^ of 
0.9 then the new pc^mlarity (X) will t>e: 
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1 Golf news wcb-psge 

2 Socoef results wcb-psge 

3 lA^tci Olynqncs wid>-page 

4 Baseball results ivcb-pa^ 



18100 (0^ X 15000 -I- 4600) 
18000 (0.9 X 18000 ^ 1800) 
17600 (0^ X 19000 4 SDO) 
15000 (0.9 X 15000 ♦ 15O0) 



1 Winter Otyn^ncs we)>fftge 

2 Soooef Rsubi wtb^phgR 

3 BasdnU icsoils wcb-page 

4 Ocrff news wd^■page 



X-19000 
X- 18000 
X-15000 
X-15000 



The cumulative siirfer hit Table for a wedc may be: 



1, Winter Olynqncs webioge a - 500 

2. Sakxer lesulto web-page a - 1800 
3^ BasebsO results wcbfage a <- 1500 
4, Golf ocws wcb-pttge a - 4^ 



Thus, the more popular web-pages can emerge and the 
less, popular decline, reflecting the fluctuaticm of interest 
j5 over time in different subjects and events. 

The database is therefor utilizing the human mini lo 
provide a powerfiil indication of what people find usefiil on 
the Internet The users themselves replace a substantial 
computatipn requirement that would otherwise be required 
20 to filter through sudi searches. 

Hie vake of Y in Table 3 is the old vahie of X, and the 
value Y will be iqxlated at intervals that are deemed 
appropriate, whidi interval could be minutes, hcmrs, days» 
weeks or longer. Ibe update interval does not need to be the 
25 same for all different keywords, as previously mentioaed. 
This is used to calculate the rate of change of popularity of 
web pages and can be used as a selection criteria. 
Different Profile lype s in the Web-page/URL Enk T^le 
50 The cumulative surfer trace includes information on users 
profiles so Tible 8 can be calculated with subscripted values 
of a for different profile types. These values of 043 gl, etc 
would corre^nd to the profile types for the subscripted 
values of X This allows the popularity of different groups of 
35 people to be recorded. 

New Web-page DaU input to the Wsb-pageAJRL Link Table 
172 

The simplest method of having new pages recorded by the 
search engines is for web-page developers to submit 
information, shown as action 176 in FIG. 4, which irifbr- 
mation inckdcs URL 66, key-words 70, site descriptions 68, 
target audience 72 and date-tie 74, each time they create or 
update a web^age. 

This information directly updates Tables 2 (URL table 
188 of HG. 4) and 3 byword URL link table 172 of FIG. 
4). The URL 66 and description 6S are entered in Table 2 and 
the date-time (74) at whidi the page is submitted (the Z 
value) is inserted in Table 3 for each of the keywords (70). 
Users are allowed a set number of keywords 70 with whidi 
they can submit their web page. An example of what Table 
3 would look like with just Z values is given below (format 
dd-mm-yy). 



40 



45 



50 



TABLE 9 





Data l^blc oeated from submissia 


D bv web develoDcit 


Key-word 


Key-wDid Key-wocd Key-word 


Key-word Key- word 


URL address 


27/02/98 27/02/98 






URL address 


28/02/98 28/02/98 


28/02/98 




URL address 








URL address 


18/02/98 


18/02/98 


18/02/98 


URL address 






URL address 




28/02/98 




URL address 


29AJ2/98 
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If there is no date for the oonibination of the URL and 
keyword in Table 3, then the new date is automatically 
inserted. If a date already exists io the l^le, then the dates 
are compared and if the dates aie too close, ifi, less than a 
pre-determined period, then the old date remains and the 
new dale is ignored This stops people from constantly 
resobmitting to get on the top of the new web page list by 
resubmitting their web pages. If the URL in Table 3 hasother 
ixywoids with values of Z doscr than the pre-determined 
period then the submission is also not allowed. This stops 
web-page developers from resubmitting their web pages 
with different sets of keywords. 

When users submit a URL they could target it at specific 
types of users (different profile type s Zl, Z2, Z3 etc) as per 
Table 3. For example, an URL submission specifically 
targeted at New Zeahmders (eg. 21) will 2^>pear at the top 
of keyword new list when New Zealanders search for that 
keyword. It will remain at the top until someone eke submits 
a URL for that keyword targeted at New Zealanders. URL's 
that are targeted at other audiences will not appear as new 
dtes for New Zealanders or alternatively they will not 
feature as higb in the new list as the ones specifically 
targeted at New Zealanders^ 

The data on new web pages does rK>t necessarily have to 
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Pdpulatirig the Personal Link Table 174 

The cumulative surfor trace is used to identify the seardi 
patterns of individual user based of sorting by User ID 126. 
Tliis information is used to iq>date the personal link table 
174 in the same way that the cumulative surfer trace 178 is 
used to update Table 3 (keyword URL link Uble 172). This 
table stores users past preferences as a form of automatic 
book marking, 

Populatir^ the Security l^Ie X6S 

Each time a user enters a keyword 52 into the search 
er^gine it updates the security table 168 (Table 7) by making 
a link between the keyword 52 and the IP address 62 (or 
rnakixig a link between the keyword 52 and the User ID 56). 
The dau iii l^k 7 is cleared periodically as the purpose is 
to stc^ systematic repeat searchii^ from affecting the popu- 
larity lists (vahie of X in Tible 3) rather than stopping 
individuals who occasionally perform the a rq)eat keyword 
search firom afEecting the popularity 1^ 
Determining the list of Web pages 

FIG. 5 shows the various daU sets previously described, 
and various irqmts and actions that result in a list of 
suggested web pages being provided, and win be described 
in rnore detail hereafter. As shown in FIG. 5, user data 
entered into the seardi engine can include: keyword 52, user 



M u%« uw*» rww ya^^ ujva uui imxsiiaruy nave to «^uu9cu luio uK scaicn engme can mciude: Keyword 52, user 

be entered by web-page devek>per& It oouM be automated 25 ID 56, search type 58, IP address 62, profile types 54. How 
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by havmg a web document template that, automaticaDy 
submits data to the search engine whenever the information 
on the wd>-page has been significantly changed. It would 
prompt the web-page developer to change any key-words as 
appropriate. 

Another embodiment requires sendirig specialist craters 
. out to find web site addresses and key-words, thou^ this has 
many of the drawt>acks of existing web-crawlers. It could 
only be effective if web designers dehl^erately configured 
their page with the key-words identified. Any web site 35 
designer4>roprietor willing to do this would also presumaUy 
be willing to submit any iqxlates to the search engine to 
benefit from the instantaneous fisting on the search results. 

An extension of this principle is to auto-detecH if a web 
address possessed key-word information in the database and ^ 
then automatically send an invitation to provide the infor-p 
matibn to enable itcir web-page to t>e found easily. The ideal 
number of key-words to be submitted with each web-page is 
preferably less than 50 and probably preferable within the 
range of about 5 and 20. This also advantageously forces 
web-site designers to firnl the most appropriate keywords to 
describe their site and also enable them to choose the 
audience they wish to target. 

The web-page submission process may also include web- 
page developer identification process that restricts the ability 
of people to use the system fraudulently. This may include 
a payment to prevent multiple web-page submissions. 
Populating the Profile ID Tabk 166 

ID Uble 166 of FIG. 4 is populated from the direct inputs 
from users. When users search the can choose their profile 
type 54 from a layered drop down memi, which could 
include, for example: 

Gender (Male or Female) 

Oociq>ation (Professional, student etc) 

Age category etc 

The user selects different profile types from the options 
they are prompted if they wish to save this as their default 
profile type. This is then recorded in Tabic 5 (profile ID'S 
table 166). The user may also select personalization optK)ns 65 
from a specific personalization options page rather than a 
drop down menu on the search page. 
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this daU can be used to determine a list of web pages 250 as 
wen and deciding which of the fist of web pages to tag (step 
lU of FIGS, i) for the purposes of creating a surfer trace is 
described berdnafier. 

The numbers (X, Y and Z) in Table 3, ^i^iich correspond 
to keyword URL Hnk table 172 in FIG. 5 coiAtaa all the 
tnfonnation required to give the foUowir^ types of searches 
58r 

Popular-list search ranked hit-list of the most popular 

URLs for that keyword based on the number X 
Hot off the press search ranked hft-list of newest URLs for 

the keyword based on the date/time (Z) 
High-flyers search ranked hit-list of best emerging URLs 

based the difference between X and Y 
Random seardi hit-list that is a random sample of URLs 

that have any of the numbers X, Y or Z 
Date created search this is hit-list based on the date time 

Z and the user-specified date of interest (not just the 

newest) 

The personal Hitk table 174 also allows past preferences 
to be listed as seardi results 

Previous favorites search is a ranked hit-list base on the 
previous popularity for the individual (X from Table 6). 
This search is based only on the previous searx^ing of 
the individual user. This allows the users to very 
quickly find ^te that they have previously visited. 

A number of other search options are also available. 

C6nventic»al search is We list of search results from a 
normal search engine (ll6 FIG. 3) 

Other content only search. This is a Est of other cootent, 
such as advisements^ associated with the key-word. 

These search results can be combined in a mimber of 
different ways 

Cbllective search ranked hit-list that is a collection of any 
of the search bit-lists described above (this is the 
default set of search results) 

Custom^ed search ranked hit-list that can be a user 
defined combination of any of the above lists. 

FIG. 5 also illustrates the use of keyword dbia 164 and 
security table 168 in a decision 246 to send out tagged web 
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pages. This deaaon is based upon the frequency of key lated. An alleraative method would be to rank the rale of 

word usage, the data in the secmity table and the presence diange of popularity by the number of places they rose 

of a user identification. The details of the decision to send compared to last time. 

out tagged web pages is described fiilly in FIG. 16. TTie formula of caliailatii^ the rate of change of popu- 

How the Different Types of Search Lists are Implemented 5 larity for this embodiment ts given by: 

More details on how eadi of these types of seaichcs is 

implemented is provided bdow along with some of the ((^-^^WCf-P)) 

advamagc and disadvantages of each. The systOT relics on where is the maximum value of X for the corresponding 

the bram power of the user, diis time to determine what sort key-words and p is an additional variable that canbc 

of search tbey want to do which will depend on what they lO changed to alter the relative significance of changes at the 

want to find. The search methods are described easily so top and bottom of the popularity list 

useis should intuitively know which one to use. The reason fi)r mult^lying by the maximum vahie of X is 

^^"Si^ ^^^ ^ to ensure that small changes at the lower popularity levels do 

FIG. 6 tOustrates the process for deteraiining a list of not swamp more significant changes higher up the table. For 

p^darwebpagpsassodated with the entry of a keyword is example, a wd) site having previously recorded only one 

270 m step 272, If this seardi is selected and a keyword is selection and then attracdng 5 hits the next day would 

entered, stq> 274 foltows and produces a list of web pages exhibit percentage increase of 500% whilst another web- 

based on the vahies of X taken from 1>ble 3 (172, HG. 5) page may have experienced an increase from 520 hits to 

for the keyword 270 entered. These web pages are identified 4000 hits (a much more s^g^ificant increase) though this 

by a unique web-page(URL) number from Table 3. 20 would otherwise j^ear as a tower percentage increase 

Thereafter, in step 276 the list of webi»agenumbeis found Random Search 

from step 274 is combined with the URL address and TTiis is a random selection of fcssi)opular webi>aces for 

web-page description from Tabk 2 (IW FIG. 5). In step 278 the user that want to kwk at wcb^ages off the beaten trade, 

the rcsulbng list of web pa^ is then teg^ depending on based iqx>n a random selection of web pa^ that has any 

thcresultsofstep246inFIG.5asdescribedprcviously,and 25 value of X, Y, and Z associated with a keyword that is 

sent to the user for them to make their sekctions. Hot off the entered. Accordingly, after a user enteis a keyword in step 

^J^^^t^ , ^ ^ 352 as indicated in FIG. 9, reference is made to the keywoid 

nG.7ilhistratesthcproccssfordeteiniiningalistofncw URL link tabk 172 ilhistrated in HG. 5, and a random list 

web pages associated with the keyword entered in step 290. of web pages numbeis are generated automatically using a 
If this seaidi IS selected and a keyword is entered, step 294 30 random number generator are determined, as iDustiated at 

foUows and produces a list of web pages based on the vahies stop 354. Only web pages that have values for X, Y or Z 

of Z taken from Table 3 (keyword URL link table 172 of assodated with the key word are chosen in this random 

FIG. 5) for the keyword entered m step 290. These web selection as this indicates that at so^e stage in the past as 

p^cs are identified by a unique web^page (URL) number used or web page devek)per thought the web page had some 
from Table 3. 'niercaficr,m step 296 the list of web-page 35 connection to the keyword. Thereafter, in step 356 the list of 

numbeis found from step 294 is combined with the URL web-page numbeis found from step 354 ts combined with 

address and webi)a^ deso^tion from Tabk 2 (URL table the URL address and web^age descrq>tk)n horn T^ble 2 

188 of HG. 5). In step 298 the resulting list of web pages is (188 HG. 5). In step 358 the resulting Hsl of web pages is 

then ta^ed depending on the results of step 246 in FIG. 5 then tagged, dqxsndingon the results of step 246 in HG. 5 
as described previously, and sent to the user for them to 40 as described previously, and sent to the user for them to 

make their sekctions. make their selections. 

The user will also be abk to sec exactly when each Conventional Search 

web-page was submitted so hitemet useiscan be aware of its This is the normal search method of a conventional search 

currency. An mdirect consequence of this feature is the epgine, referenced as other search eiigine 116 in HG 3 

mccntive for web dcagners to update theirsites. The promi- 45 which may or may not be inchidcd ak>ng with the searches 

nenoe given to iiew and i^xlated ates provides a means of according to the present invention, at the option of the user 

becoming established OD the popular hit-list and encourages as noted previously. * 

the use of ^ropriate key-words and rewards the up keeping Content Only Search 

of web pages that users find usefid. This is a list of content, such as advertisements, associated 

*^;?y5?^^^^ SO wit the key-word, ^ch the user cannot control. The ones 

HG. 8 ilhistrates a high-flying web pages search assod- that have paid the most wiU be at the top of the Ifet, as 

ated with the keywwd entered in step m This is a list of described fiirthcr hereinafter, in accordance with the pre- 

wcb pages that ate increasii^ in popularity fastest If this ferred embodiment of the invention. Of course, other sys- 

search is selected and a keyword is entered, step 324 folfows terns for identifying the order of paying content providers 

and produces a list of web pages based on the relationship 55 can also me implemented, 

between the values X and Y taken from Tabk 3 (172, FIG. Previous Favorites Search 

5) for the keyword 320 entered. These web pages are FIG. 10 illustrates a previous past favorites search that is 

identified by a wrique web-page (URL) number from Table based only on the previous searching of the individual user. 

3. Thereafter, in step 326 the list of web-page numbers found This allows the users to very quickly find sites that they have 

from step 324 is combined with the URL address and 60 previously viated and performs, therefore, automatic book 

web-page description from Tabk 2 (188 FIG. 5). In step 328 mariting. It should be noted that since a password is pref- 

Ibe resultinjg list of web pages is then lagged depending on erably used to logon to the search engine system according 

the results of step 246 m FIG, 5 and sent to the user for them to the present invention, the user will be abk to access their 

to make their selection. personal preferences from any computer. 

The high-flyer list is calculated by comparing the old 65 Thus, when the user types in a keyword at step 372 as 

popular ranking (Y) and the new popular ranking (X) from indicated in FIG. 10, step 374 follows during which il is 

Table 3. From this the pcrcenUge increase in hits is calcu- determined what are the favorite sites (based on previous 
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usage) for that keyword firom the peisonal link table 174 beis found from step 404 is coinbioed with the URL address 

iUitstrated in FIG. 5. Because the user has a password that and web-page desoiption fironi Table 2 (18S FIG. 5). lo stq> 

can be used to bgoa to the system the user wOl thus t>e able 438 the resulting lit of web pages is then tagged, depending 

to access their personal preferences form firom any com- on the results of step 246 in FIG. 5 as described previously, 

puter. 5 and sent to the user for them to make their selections. 

Due to this search capability diere is, therefore, no need Customized Search 

to manually bookmaik web page& If a user foigot to FIG. 13 illustrates a customized search that allows the 

bock-mafk a good site on, for example, 'marbles', they can user to decide how they want their default hit-list to appear, 

easily find it by retyping the keyword that lead them to that In stq> 4<k2, the keyword and User ID is selected in order to 

site. If a user's preferences change they will be reflected in 10 initiate the customized search. Prior to initiating the cus- 

the perscmal links table 174. tomized search in step 466, ^liiich step is icfentxcal to stq> 

Another embodiment of the personal preference search 404 of the collective search previously described with 

iodudes specifying the date the web page was list vi^ted, re^wct to FIG^ 1, however, step 464 is applied to customize 

with or without usii^ a keyword. The web pages are then the users default mixture of hit-lists For example a user may 

ranked based on Z in peisboal links table 174 of FIG. 5. For is want their default search results to include only popular and 

example if a user looked at a site in the middle of last year new web pages but no high flying web pagesw This custom 

the user can refine the search by date, thus making it easier search is then performed in stq> 466 to generate a list o;f web 

to find a previously useful web-pagcs more easily, even if page numbers. Thereafter, in step 468 the list of web-page 

they could not lemember the relevam keyw(»d numbers found fitom stq> 466 is combined with the URL 

This automatic book-maiking feature can ako act as a 20 address and web-page description fiomlbble 2 (188 FIG. 5). 

device for monitocing the type of Internet use being uixl^- In step 470 the resulting list of web pages is then tagged,' 

taken by a partknilar computer and thus for example, can depending on the results of step 246 in FIG. 5 as described 

provide warning to parents/employers of children/ previou^, and sent to the user for them to make their 

ernployecs accessing nndesirabk sites, such as adult web- selectioos. oite preferred embodiment, the makeup of the 

page& In a preferred embodiment, for parents/employers 25 default search results list can be amended by 'learning' bom 

unlikely to use the computer themselves, notification of such the use's behavior to create a changing customized search 

usage 15 automatically provided by letter to the parent/ based on the user's own search patterns. If a user consist 

etDpk>y^ that lists the keywords selected and web pages tently chooses new web pages or high-flying wdb pages for 

vishedbythedxi]dreri/crripk)yees.T1iisinformatk>nis example, ten their set of de&ult search results wiQ t>e 

directly horn each us^ table 174 of FIG. S.Tliis requires a 30 changed to reflect dieir normal search style, 

user identificatioii code that also iridudedparental/einployee Magazine Search 

information. The rnagaziix seardi according to the preseiit invention 

Cbllective Search enables useris to search by foUowing a series of merra-driven 

The collective search, as illustrated in FIG. U, is the subject choices (or siniilar hierarchical stnicture), rather 

default search according to the present invention and is used 3S than entomg a spcd&c key-WDnI(s). 

when the user does not actively choose on of the other search Existing magazine-style search engines require editors to 

options set the structure of information, decide on its relevant merits 

Upon entry of a keyword in step 402, that keyword is used and set the criteria, such as price, for ^ce on a given page 

to select boin a combination of web page selections asso- transmitted to the user/viewer. Using the seardi system of 

ctated with that keyword. Ashown, for example, in step 404, 40 the jvesent inventK>n, the users' thepiselves dynamically 

an equally weighted combination of conventional, popular, decide what is and is not worth seeing. Tfaus^ although 

highfllier, new and past search results is used to obtain a list editorial input is needed regarding a hierarchy of subjects, 

of web page numbers. Thereafter, in step 406 the list of the web-pages that emerge as the most popular for each of 

web^page munbers found firom step 404 is combined with these subjects will evolve automatically, 

the URL address and web-page description firom Table 2 45 Use of Data Sets for Different Groiqis of People 

(188 FIG. 5). In step 408 the resulting list of web pages is Different popular hit-libts may be employed to provide 

then tagged, depending on the results of step 246 in FIG. 5 results which would reflect different cultural, geographical, 

as described prevx>usly, and sent to the user for them to professional, gender or age interests. Thus, as shown in FIG. 

make their selections, the system is first configured, the 14, when a user enters a keyword and User ID in step 490, 

search engine 10 database will rx>t posses any information 50 the default profile of the user can be used to reflect the type 

on popular, high flyers and new web page hit-lists, so search of web pages that people of the same "group** as the user 

results win initially be obtained from the conventional profiles desire to see. Thus, the search that takes place in stq> 

hit-Hst (normal seardi ei^ine), and the tagged web pages 494 is based on the subscripted X, Y and Z values obtained 

then used to create the database sets as have been described. firom the default profile of people of those ''group" affilia- 

As the system develops, the data sets associated with each 55 tions identified in the user's personal profile obtained in step 

of the other search types will become populated, aixl 492. Thus, the rather than an overaQ global search result. 

Searches using the other search types will become more search results are obtained particularized for the group that 

useful. the user identifies with. The resulting list of web pages. 

Date Created Search derived firom steps 496 and 498, as have been previously 

FIG. 12 illustrates a date created search at allows the user 60 described^ are particularized for that group, 

to select the date that the web-page was submitted. This Tbus» for a particular user with the profile type New 

, feature will only work for wel>-pages that contain a date Zealand selected as a geographical factor, a search for team 

created data entry, identified as date-time submission 74 in field ^rts and related key-words, rugby material might 

FIG. 4. Upon entry of a date-time and/or a keyword in step figure prominently, whereas an American profile type may 

432, the search engii}e 10 will perform step 434 in which a 65 produce a bias towards baseball/American football material, 

list of web page numbers associated with these variables is for example. This technique offers the ability to discriminate 

obtained Thereafter, in step 436 the list of wct>-page num- between the different meanings of the same words, accord- 
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iog to the context of the popular hit-list associated with a determining which seb of results get sent out with "tagged" 

particular profile type. A general search using a key-word web pages. . Reference with jcspcd to this was already 

'accommodation' for example would indude results related mentioned with respect to authenticatiiig user's of a particu- 

to housing, renting and similar whereas if the user indicated ^ ^ S^^» doctors ixi the example provided, 
an interest in optometry in their profile type, then the term 5 As shown in FIG. 16, after entry of keywords and other 

'accommodation' would be interpreted quite differently. <^ ^ t^ic ^ three decisions that determined 

Ihe relevance of such sites will evolve . automatically, whether results are actually "tagged" as has been previously 

without any active evaluation of the sites by the search described in step 118 of FIG. 3. 

engine operator or the tiser. There aire no complex olgorithm ^ shown by step 556, for a user that has a user ID and 

required to analyze the relevance of web-sites for particular lo chosen to use the peisonal h'nks table 174 of FIG. 5 

types of USCTS. Instead, the type of site deemed relevant will (Table 6) as previou^y desoibed, it is necessary to "Ug" all 

be decided by those users selecting those characteristics for results so that all of their past preferences are 

their inofile type , Le. American females interested in recorded in their personal links table 74. search engine 

rock-climbing. Sites of greater relevance will naturally ^cording to the present invention system can update the 
attract more hits, increasing their ranking and thus increas- 15 ^r's personal prefererxxs but noi update Table 3 if certain 

ing the cfaaiKX of a subsequent user also investigating the security levels have not been satisfied (see below), If, 

site. In the above exainple, any web sites iKted for the however, the personal link table 174 is stored on an indi- 

keyword 'accomodation' which were unrelated to vidual's conqmter rather than at cwitrallocation there is no 

optometry, agbt, lens, vision, etc., wcmld not be accessed for ^ tagged results as the daU is stored locally, 
the period of tone required to make a valid it. It would 20 ^ shown by step 55S, when a keyword is submitted, a 

therefore receive a very low ranking and hence be even less check is made that the IP address 62 has not already searched 

likely to be accessed by Her users. tbe keyword using security table 168 (Tabk 7) before the 

The user can select differmt profile types for diffmnt ^ tagged results. If so, the user can still 

searches during a angle session and isiK)t be restricted to the undertake the seardi though it will ndt contribute to the 
default profile types. 25 ^^^'^^^^^^^^^^l^O (Table 4). This allows all normal 

In a fbrther embodtuKnt of the invention, there can be ^ ^ed the pqmlar hit-list and all users to search 

included a level of authentication for person's of a certain whatever they would Kfce, but prevents fiaudulent users» 

group to have their search results ally be used forpurpoeeof such as ^>ammers, from contributing to the popular hit-list, 

updating the database relating to that group. For example, security table 168 can also include information on Imks 
doctors who have a user ID that identifies them as doctors 30 I'^tween keywords 52 and a user ID 56 to detect repeat 

may perform a sean^ related to a certain medical oonditioQ, searching. 

and their selections can be tagged aixl used in the database While it is possible for user's to change the IP address of 

for that groi^ of doctors as has been previously described. computer, this is also detectable and preventable by a 

However, although patient's may desire to identify their number of methods such registering and traddng the use of 
profile with that of the same group of doctors, their selec- 35 ^ numbers. 

tions are not as significant as those of the actual doctors, and Other methods to exclude false searches include: 

thus while they are able to vkw the web page listings that Only creating a surfer trace for users with a user ID 554 

doctors deem most pertinent, their selecticms are not used to recorded with die search eogine. 

update the doctor's group database, ance their IDs do iK>t Extending the time lunit requited to make a visit count as 

identify tfiem ats a doctor 40 a usefiil tut 

Limiting Search Options Do not count angle visits to a URL from a keywoid (for 

Another feadue of the present invention is keyword whidi there is no means of measuring a lapsed-time). 

eliminatOT feature, which is illustrated in HG. 15, and As shown by step 560, popular keywords can be traced 

prevents certain users such as children, from searching for once every tenth, hundredth, or even thousaixith occurrence, 

undesirabfe keywords and web-pages when the keyword 45 and the frequency of this selection can be changed to 

eliminator feature is turned on. The present inventor's have optimize the system. The frequency of keyword usage is 

realized that it is potentiaUy much easier for example, to stop determined fipom keywrad table 164 as shown in FIG. 5 

cfaikhen searching for poroogr^hy, rather than attempting (Fable 1). The firequency of sending out tagged results can 

to trace and prevent access to all sites on Ihe Internet with also be linked to the rate at which popularity is changing for 

pornographic content Hiis would be used as a complimen- 50 different key words. For example the keywml^IBM'' would 

tary tool to existing "net nanny" type devices. Thus, as probably have IBM's home page at the top and most user's 

shown in FIG. 13, with the keyword eliminator turned on, a would go there, whereas the key woid "latest fads" may 

ineexisting table inaccessible keywords is stored in a Uble have a oonstaoUy changes set of web pages that needs to be 

and compared in step 522 with a keyword previously sampled more frequently. 

entered, as shown by step 520. Thus, keywords that are 55 To avoid the keyword URLlink table 172 of FIG. 5 (Table 

inaccessible will not be searched Thus, for example, parents 3) from becoming unduly large, one method is to only 

could choose the types of keywords 552 that they do not register keywords in Table 3 once they reach a certain 

want their children to search for— and this will be different frequency of usage. This is controlled by not sending out 

for different sets of parents. The ^em filters out the tagged rests for less frequently used keywords (found from 

keywords that may be used for subsequent searching in step 60 Table 1). 

^* Active Suggestion of Web Pages to \^'t 

Determining Which Users to Sample Another feature of the present is illusUated by FIG. 17, 

FIG. 16 illustrates the process of determining which and involves using data to actively suggest web pages. TTiis 

search results should be sampled and used to make up the is different from a search because the user sets up the request 

cumulative surfer trace table 170 of RG. 4, also referred to 65 and is informed if there is any new data on the subject. To 

as Table 4. While possible, it is not necessary to collect data do this the users has to actively ^>ecify which keywords they 

iconceming every single search, and this can be controlled by arc interested in and the profile type that they would like to 
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act as a filler or agent and the search type (new, highyfiying, 
popular) in sXcp 588. This infonnadon is stored in the user's 
profile ID 166 shown in FIG. 5 (Table 5), 

Thus, at various interval's the user receives a list of 
suggested web pages detennined by a group of like minded 
humans. For example a user may dioose to be notified of 
web pages with the following. 



Keyword SSZ 
Rugby 

Deny tnstiuciits 



profile type (agent 588) 
New Zealaiid^ Male 
Dentist 



Scaidi type 586 
flying 



10 



. This way if there are highflying web pages on ''nigt>y^ 
that other New Zealand males found usefiil (Le. they ^>ent ^ 
a significant amount of time looking at the information — 
high rate of change of X in Table 3) the user would be 
notified. SimOarty if theie was any pew informatioa on 
"decay treatment^ submitted for dentists to look at, the user 
would be identified about it (value of Z m T^le 3). It is 20 
onlikcfy that a computo' agent win ever be as good at 
filtering infonnation as a selected group of peers. An advan- 
tage of this system compare to other ''agent type*' software 
is that this does not require any software on the user's 
computet: It is all inchided as a natural extension to the other 25 
seaich engine data sets. 

Tbe suggested web-sites can be displayed £[>r the user 
when they next access the search engine or they may dioose 
to be notified of these suggested web pages via e-mail 
notification. This way web pages can be drawn to the user's 30 
attcotioa without any active searching for these keywords. 
Pas»ve Suggestion of Web Ihkges to Vi^ 

Another feature of the present is iQustrated by FIG. 18, 
and involves automatic wi^>^age suggestion based on how 
the user has searched in the past and requires no active input 35 
firom the user. 

As shown, in step 620, upon the eiatiy of a user ED, the 
system can be activated passively, at various intervals or 
times (such as at each login to the search engine^ by looking 
at wtucfa so keywords, profile types and search types, the 40 
useis fiequeotly looks at u^g the personal links table 174 
of FIG. 5 (Fable 6). For example, it may be that the user 
frequently looks at Rugby infomiation as a ''New Zealand, 
male" and looks at decay treatments as a "dentist". This 
infonnatton can be found from the automatic book marking 45 
table, previously referred to personal links table 174. If the 
user has not looked at these subjects for a certain length of 
time and there are new or highflying information sources, 
the user will be automatically notified of these new infor- 
mation sources. 50 

In a modification of this embodiment, a periodic e-niail 
can be sent out with the two newest aixl highest flying sites 
related to the key- words of the user. 
Determining a list of Suggested Keywords 

A problem with Internet searching for Many users is 55 
knowing which key-word to use for searching. While the 
present inveutioo could be implemented with au infinite 
number of keywords, too many key-words (includes 
phrases) that users choose can be problematic. 

Accordingly, as shown in FIG. 19, tbe present invention 60 
also provides for a data set 642 that provides synonyms for 
the keywords entered along with tbe particular profile type 
in step 640. Tbe system represented in FIG. 19 is referred to 
as a key word suggester. This is implemented, in one 
embodiment, by matching the key-word entered by the user 65 
in step 640 with the existing keywords and phrases in 
keyword tabk 164 of FIG. 5 (Table 1) that other users have 



tried u^g other search methods, identified in step 646. Each 
keyword is then tagged in stqp 660, and those that are 
selected by a user in step 662 are used to form a keyword 
surfer trace 648 as shown in FIG. 19, which contains the 
original keyword 52 that the user entered, the keyword 
selected 652, and the IP address 130, user ID 128 and 
date-time 132 data as in the previously described web page 
surfer trace. 

The data from the cumulative keyword surfer trace 648 is 
then used to reinforce links between keywords. In this way 
the system learns which keywords are associated with each 
other. The system learns which words are related to each 
other in the same way that the system learns which URL's 
are associated with the key-words. The lists of suggested 
keywords will become more relevant over time as the 
relevancy is improved eadi time the keyword suggester is 
used. 

Oeattng Data Sets that Determine the Suggested Keywords 
As shown in FIG. 20, a keyword link table 696 and a 
cumulative keyword trace table 698 are used along with tbe 
previously described security table 168 to create the data 
sets for suggested keywords. The key-word link table 696, 
shown in Table 10 below, records bow ofreo each key-word 
IS selected from the suggested key-word list This can then be 
used to rank the of the usefiilness of different key-words 
relative to each other. 

lABLE 10 



Key-WQcd 
1 



2 



Key-wont 
3 



4 



Key-word 
5 



Key WDid 1 
Kcf-wonl 2 
K^WDidS 
Key^woid 4 
Key-wont 5 
Key-woid 6 
Key-WDid 7 



20 



— 1134 
356 — 



20 



168 



It can be seen from the Table 10 that people who entered 
key-word 2 found key-word 3 the most useful followed by 
k^rword 5 then key-word 1. Tbe keywords can have a 
directional aspect, for example, keyword 3 was found useful 
1134 tioctes after trying keyword 2. However keyword 2 was 
found useful only 356 tries afier users tried key-word 3. 

Information about tbe links between keywords in Table 10 
is updated by the information about how people are using 
suggested keywords (keyword surfer traces 648). The cumu- 
lative kejrword surfer trace 698 is the combined information 
from all individual keyword surfer traces 648 and it is used 
to determine bow many *^hits " (significant visits) each 
keyword had for each key-word. 

The information collected from each individual isurfer 
trace is a sees of inputs become a cumulative keyword surfer 
trace, shown in table form bek>w in Table 11. 



lABLEll 



Keyword cumulative surfer trace 



IP Number 



User ID 



Keyword 
(origiiial) 



keyword 
(sugeested) 



Date-time 



FIG. 20 also illustrates how links between keywords in 
Table 10 can be initiated by recording sequences of key- 
words that users put into tbe search engirte. If for example 
someone searches using the keyword "NHL" and then 
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"National Hockey League", this would then draw an asso- indicates that these keywoids are similar. For example a user 
ciatioD between these two kcy-wofds in Table 10 by record. may search for "film reviews" and the keyword suggester 
ing this as one hit Again this captures the rcasoniog power inay come up with "movie reviews" which has a more 
of users to define the link between two keywords. Often the comprebenavely sean^ list of sites. In this case there is 
kcywoidmsequencewinbctotaaynnrelatcdtothcpicvious 5 no phyacal similarity between the words movie and fihn, 
key-word but sraaetimcs it wiU be relevant If the next user but they arc linked by the similarity of the patterns of URLs 
chooses It firom the key word selector it will reinforce the links they have in common in Table 3. 
key-word link in the same way that repeat sclectwn to web The usefulness of the key word suggester list is enhanced 
pages remforces links between a keyword and a URL. indicated by step 744. by associating with eads kcy-woid on 

The foUowmg is an example of keywords that may be lo the suggestion list an indication of whether there are any of 
suggested after entering the a simple key-word like "Book" die aforcmendoned searches available (popular, high flyer, 

book sales etc-) for that key-word in keyword URL links table 172 of 

book reviews FIG. 5 (Table 3). The keywords with the most search results 

^)ecialist books are then highlighted. 

second hand books Decision to Send Out Tagged Keyword Suggestions List 

. , The security table 16S and keyword link Uble 696 arc 

* . . ™?. ^ used to determine which keyword Knks to sample in a 

^)ecial edition books manner similar to that prevfoudy described with respect to 

AU of these key-words (pta^)wouklcoinefroniinfbr- taggwebp^es.As with the dedsicMifi)r tagging web pages 
MtiooseeteK (users) and inf<^ 20 this can depend on whcthw it ^ a repeat keyword (found 

devekypers). The most appropriate keywords will em^e froin security table 1€«) and on the fiequency of keyword 
natan% overtime. usage (found from keyword table 164X as wefl as the 

All keywords used by users are entered into the key-word conskleratbns prcviousiy discussed, 
link table 696 of FIG. 20. Thus, if people enter an uncom- Determining Other Cbntent 

mon k<7word such as "cassetes" instead of "cassettes^ the 25 When searching on the Litcmet, various different web 
key-word suggester will suggest that tithe user tries «cas^ pages listmgs and web pages arc displayed as has been 
s^tes". Th<^ IS therefore, no need to create a set of described toe common chacter^ of each these different 
URLicyword bnks m Table 3 for "cassetcs" Thus saving on web page listings that have been described is that when diey 
daU q)^ and here IS also no need to said a tagged set of are delayed they appear substantially identical to one 
results for the keyword "cassetes". Hence there win be less 30 another As shown in HG. 25, eadi of the diffoent Kstmes 
data sent b«± to the search engine. 90O, though the text may be different, is othcrwfae visually 

It IS also a contemplated embodiment to run the keyword identicaL Other listings 902, however arc many times larger 
suggcstca- mce Tibk 3 and have high flying keyword asso- than the Hstiogs 900, may indude gr^hical content, and 
ciations and new keyword assoctatk>ns so the system can appear more promment when displayed to the user Such 
learn how keyword associations change over time. For js listings can contain the same content as a web page listing, 
example, the keyword suggester trace wan store the most or other content, sudi as advertisements, pictures, editorials 
recent keyword links and modify the main key-word the by and the like, 

ahistoryfactor,intiesamcwayasTM>Ie3ismodifiedbythe This other content maybe displayed to a particular user 
cumulative Kirfer trace. based upon key-words, user profile type (nationafity, age, 

^TliecumulaUve keyword surfer trace 698 is processed in 40 gender, occupation, and so forth) and the time of the day for 
the same way as the cumulative web^age surfer trace 170 example. 

of FIG. 5 to reinforce links between keywords in the In many instances, this conteiit that is displayed atong 
facyword link table 696 (Table 10). A time variable can also with web page listiiigs is inserted into the display area using 
be inchided so that if a user chooses another keyword very mechanisms that are different fiom the searching system 
quiddy It IS assumed that the previous keyword was rot 45 described prcviousiy with respect to conventional search 
useful arid IS not counted as a keyword surfer trace. ermines. The mechanism by ¥^di this content is disajlayed 

Also, the mdividual keyword suggester cati store, for each in large measure based upon some other criteria, such as 
user, Iheff personal keyword links. Further, the keyword payment for the ^ace that is used. While the system for 
suggester can be based on a number of different profile selecting this content worics, it is difficult to keep track of 
types. The word associations may be quite different fi>r 50 whidicontcntwasdi^layed when, e^)ecially if that cwilenl 
people of different culture, oalbnality, occupation and age is frequently changed Thus, another aspect of the present 
etc. Different keyword sugg^crs wan capture the keyword invention, which wiU now be d^cussed is a system for 
assooalbn of different groups of people; The keyword hits traddng changmg content, and aUowing (or content provid- 
m Table 10 can be subscnpled in die same way diat the ers to dynamically select when their content wiD be dis- 
values of X, Y and Z are subscripted for (Afferent types of 55 played. 

profiles in Table 3, as explained previously. Usmg the Tables This dynamic selectable content as illustrated in FIG 22 
^ S?*^* suggested keywords maybe displayed to the viewer based upon keyword o^ 

HG. 21 inustrates a variety of manners in which a list of profile type as entered by the viewer in step 762 as shown 
suggested keywords can be created. Within the content selector step 764 that then foUows, the 

One maMerfi by ranking the vahies of X in the key^ 60 time of the day is considered and used in selecting the 
hnk table 696 (Table 10). This ranked list of keywords is appropriate content 9«2 as illuslrated iq FIG. 25 along with 
combined with keywords from a normal search of keywords, the web page listings $W. Each content 902 transmitted with 
described previously with respect to step 646 of HG. 19. the search results made up of web page listings 900 is tagged 

Anodier manner of suggesting keywords, shown as step in step 766. Thus, if a user in step 768 selects that content 
730, IS to compare the popular list (URU X values) Or the « 902, the results of that selection is fed back to the content 
user-entered key-word with the popular-list of oUier key- selector 764 so that the content database associated 
words m Table 3. A similarily pattern X vahics in Table 3 therewith, can be updated as surfer trace dato in a manner 
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such as has been previously described. Iheieafter, in step In addition to Ihe key-woid dataset S04 it is possible to 
770» that content 902 is displayed, typically siandtaneously have a data set of the following type for different profile 
with content 900 types 806. It contains the same entries for each profile type. 
In addition to the surfer trace data beii^ input as has been instead of keyword as descr3>ed above with req[)ect to the 
previously described, this content embodiment also provides 5 keyword contmt data table 804 of FIG. 2i3. 
for the web page developer, or content provider, to deter- 
mine the fiiequency with which this content will be TABLE 13 
reviewed, and, depending upon the patterns of users with — — — — —— — 

reflect to web page listings that are viewed alter the maimer pcraomd profile cxmtent i^bte 
in v^iich the content provider's content 902 is displayed 

based upon key wcHds, user profile and the like. In Older to ^° ^"^^^ ^SST 

miplcmcnt this dynamic content flexibility, there arc three hits for sent der der 

additional data tables, illustrated in FIG. 23, v^iidi are used one out 1 2 ibtal 

to track the changing content 902. These tables arc keyword Profile type month (H> (ff) {m) (a2) (I) (f) 

content data table 804, personal profile content data table 

806; and content provider data table 81Z 1* 

Keyword content data table 804 is illustrated in more Professiona] etc 

detail in l^le 12 bekyw, aiod its cfaaracteristks are: Undefined profile 

His the cumulative number or hits for one time period for ' — ' ' ' 

^^^r^;^^^ Tablel3determinesthecontentvalueofthecontent902 

that Keywon^ ^ ^ specific andteooes of people as opposed to different 

N is the number of tinoes particular content 900 that is keywords and aHows fc»r targetirtg of specific audiences, 

associated with a keyword has been sent out for display. it is within the scope of the present invention to inchide 

This is not necessarily the same as H since contact combinatioa profile types in T^le 13 as well, such as male, 

associated with a proffle type may be have a differrat professional or New Zealand, females. The content value for 

selectioD factor than content associated with the key- ^ the oonibined profiles win be different am the content val^ 

word. Us selection &ctor can be various variables^ such of individual profiles. Ibe mechanics involved in determio- 

as votes or price; ing the contact value and choosing the content 902 will be 

A K the selection factor for the keyword from each ^ same, and described further hereioafier. 

content provider (e.g. a selection &ctor could be a $ bid Content proWder daU table 812 oi FIG. 23 is illustrated 

to be associated with that keyword); ^ in more detail below as lM>le 14 and contains information 

T is the total of the selection factors for each keyword and ^ content provider, such as name, address, 

: is the sum of A's; and advertiser, content infonnation such as the ]^tni^(inMLor 

P is the content vahie, as determined by votes or price, for ^ amilar) that the coritent 902 wiD use and a 

eadi keyword and is T/N(e.g.thfe could be the $ per 35 iinique number to identify eadi different item of content 902. 
time content is sept out with that key word — this is a 

price of being associated with that key word) TABLE 14 . 

Uiui{iie fmiwh gf 

TABLE 12 Name Address etc OtmteiA Infonnatum for csch Cbntcnt 



KcywDid content data set* ^ E^John Content, na 

Content, no. 



Amount of 

Qtnmlatnre Cbntcnt ' Content Qm tfatt 



iiitsforone tent out Provider 1 Pkovider 2 Ibtal This Table may also store details of the content jHovider, 
Kcywoid month <H) (pi) (Ai) (A2> (I) (F) such as passwordbs, payment details (e.g. credit card number 
3^ ^ and anthodzaticMiX content delivery (number of times oon- 
Fith tent has been sent to users) eto. Hie daU sets for the above 
• : mentioned content tables are populated as follows. For the 

...... . . 1. keyword content data table 804 

This Table can also mdude the maxmiumconteiit value M „' , , . , ^. , . 

that the content provider is prepared to give. Tbeie is no » cuinidativenumb«»of lutsfor apar1i«larkeyj^^ 

l>.mttolhe.nunirrofconlentpiovide.st]»tmaytfle.np^ tune penod. .s taken direcUy &om Table 1 



have content 902 di^layed with a web page listing that is 



associated with a particular keyword. N is the number of times content is sent out associated 

It is possible to have a separate Table 12 for eadi country with the keyword TTns is incremented each time aU 

or area, so that the content vahie per country or area, per 55 '^^.f "'"^"^ ?^ ^ displayed to a user that is 

keyword could be different In addition there couM be speaficafly associated with that keyword 810. 

different content values for different time periods in each values for A 802 are selected by content providers for 

country or area. keyword. Hie content provider can also enter a 

It is possible that provider's of content 902 couW target maximum vahie M over which they wiU no longer 

both the key-word and the audience by identify each of the 60 *® ^ ^ent out with the keyword, 

keywords with target audiences, e^. the number of hits T is the total for each keyword and is the sum of As 

associated with the word rugby could be broken down into P is the content value, as determined by votes or price, for 

the different profile type s that search for the word rugby. each keyword and is T/N 

The cumulative number of searches for rugby could be 6000 Populating the Personal Profile Content Data 

split into 520 under 21'$ and 4000 21-50 year olds and 520 6S H is the cumulative number of hits for each profile type 

50'age group. Tbus^ there may be a different content vahie and this information is taken directly from Table 1 (sum 

for each of these sub classes within a keyword search. of the indexed W's). 
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N is the number of items of coDtent 99Z sent out assod- of content 902 is displayed \^uld be incremented only for 

atcd with the personal profile. This is incremented each the ^mak under 21' Table (this: is the figure used to deter- 

time an item of content 902 is sient out that is ^jecifi- mine the vahie of the content per unit view, 

cally associated with that profile type 810 Determining Which Specific Content is Transmitted 

The values for A 80S are placed, through an entry process 5 The example below shows how content associated with 

alrin to biddings for each profile type. The content the keyword is selected. It is the same process for content 

provider can also eater a maxirouin M they are prepared associated with profile types, 
to pay, or vote, as the case may be. 

T is the total for each profile type, and is the sum of As. 

P is the content value for eadi profile and is T/N 10 
Populating the Content Provider's Details Table 

Ibe majority of the content provider's deals 812 are 
electronically entered by the content provideis. Bach time a 
content provider's content 9(XZ is sent out this event is also 
recorded in the content provider's details Table 8 12!. This 15 
will also record the nuniber of dick-throughs (820, 822, 824, 
. 826, 828) and the cost, in terms of payment or votes, of the 
content 902. This will form the basis of the clectnmicbiU or por the key-woid 'Hook* the content 902 of content 
tabulation that is thereafter forwarded to the content pro- provider 2 would be dsplayed whenever the keyword was 
yitfer. 20 searched, as they are the only content provider associated 
How the DaU Sets are Used to Select Content Sent Out to wit that key-word. However, for the key-woid "fish", con- 
Users tent provideis 1 and 2 would have their conteixt sent out the 

In the discussbn that folbws, with reference to FIG. 24, sanie number of times. In the system scaled to the levels at 

it is assumed that only one banner of content 902 is which it is intended to be used, them will be a very laige 

transmitted with each set of web page search results 900. 25 numberof content provideis Mdiing for different keyw^ 

The same olgorithm apply if there are nralt^le sets of and profile types, 

content transmitted with each set of web page results. Calculating the Value of Content 

A keyword and profile type are submitted to the search If there is a new content provider who, for the keyword 

engine in step 852. From Ixyword content data table 804, '^boc^" vahtes the content at, for instance, $5 per month, 

personal profile content data table 806, the value of content 30 This will change the value to OSHS and this win mean that 

902 for each is found &om the vahie of Pin the Tables. The the total assodaled with the word book is $15. Hierefore, 

lugbest value of P for the keyword or profile type, deter- content provKter 2 would now get transmitted 66% of the 

mined in step 8624 determines the type of content byword time (10/15) and the new content provider would be dis- 

or profile type) that is transmitted ^ng with the web page played 33% of the time. Ibe (vopoition of time an ccntent 

listings 900. It may be that there is no specific value for the 55 provider's content is transnutted is A/T. 

keyword and the user may not be using a specific profile How Content Provider's Use the Data I^bles 

type. In this case the values for unassigned content items When bidding for content 902, content provideis select a 

win be used (fiom Table 13 for users without a profile). keyword or profile to target their content from Tables 12 & 

Choosing which ^tecific content item 902 is sent out is 13. The search engine indicates autoroaticaDy the number of 

discussed bcUm, The details for the ccMitent item (their 40 times this search has been peiformed for the previous time 

giaphics, text, associated programs, etc) are taken torn period 01), the number of times items of content were sent 

l^le 14, content provider details table 814 and transmitted out associated w^h that selection ^ and the value of the 

to the user in step 868. Details of the content items 902 content P. 

: transmitted for ea^ content provido^ are also sent to the The new content provider then enteis the selection factor 

content provide^ as shown by step 870, at regular intervals. 45 A and the system can then instantly cakulale the new value 

Determining Whether it is Keyword or Profile Content That (P) based on the new total bids (I). The advertiser can also 

is Transmitted be told the number of views per month they are likely to get 

The type of content 902 transmitted is dependent upon for their bid (N*(A/1)). These changes are calculated in 

whether it is a key word based content or profile cation real-time to give the new content provider an indication of 

based content For example a Male £rom the US may search 50 how their bid wiH influence the value and the views they will 

for fish. The vahie applicable to this search is, keywordafish, receive for their bid. If a value and number of views are 

profile»male, profile-US, profile US, male. When deciding agreeable to the advertiser they can choose to submit it as a 

which content gets di^layed the system compares the value bid for the defined period, such as a day, week, or month, for 

ofthe content for all the possibilities (keyword combmations instance. The details of other content providers are, 

of profile types) and sends out the content that has the most 55 preferably, not made public. Omlent providers may also 

value as determined in st^ 862. For example an under 21 enter a maximum vahie M they can pant with for their 

male may search udng the key-word "Ru^y" and the value content This provides content provideis with some security 

for the associated content for Rugjby is 0.1 per view, whereas against paying too much if the vahie changes. If the value 

the value per view for targeting an under 21 male is 0.2 and goes loo high then a content provider's bid can drop off the 

thus the content taigeted at the male under 21 would be 60 list (if P is greater tfiah M then A is not counted as a bid for 

displayed rather the rugby content It is important to o<^e that particular content provider). The bid would go back on 

that the cumulative frequency of times that content items the list if the value went down again, thus acting as a 

SH)2 are transmitted (N) will be different to the total cumu- stabilizing mechanism. Tbe content provider can, in a pre- 

lative frequency for the targeted area (H). In this example the ferred embodiment, be notified by e-mail if their content 902 

cumulative frequency (H) of the number of times 'rugby' is 65 has dropped off the list due to their vahie limit M . 

searched for and 'males under 21' would both incremented As shown by tbe content provider details table 812 of 

by one (via Table 1). However, the number of times an item RG. 24, for instance, content providers thus have an account 
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with the search engine proimlois and procedures for deb- keyword. The default setting is that a hil on content counts 

iting their account for their content is automatically calcu- the same as a hit hoin the Est of web pages. The vahie of 

lated bom the account details on a periodic basis. An content hits can be set higher or lower than imity^dependiiig 

electronic statement of the number of views, cost per view, on the price of the content, e.g. the '^content hit £ictor^ may 

numberofclick-throughs and cost per cHck-througb for each 5 oeed to be increased for valuable keywords as this would 

content provider is also forwarded to each content provider, decrease the ability to spam these commercially valuable 

since this information is also stored in OHStent provider keywords. The higher the contept factor, the higher the 

details table 812 (Table 14). In a preferred embodiment, it is resistance to spam as the search results would be more 

possible to identify chisters of similar keywords based on dependent on price rather than popularity. 

tbekeyv«>rflinktabfc^ TTie time period for content biddii)g 

keywords isso that content m can be taijee^^ ^^^^ ^^^^ ^ certakalnoimt for a particular 

words rather than just m^^ ^ period e.g. one month. This time period may be 

key.word W may mcludehuii^ differ^depen^gon the rate^f-change of the price. If the 

that have hnks^to the word car (e.g^ convertibles chaS&nipidly or is very stable, the time period 

automobiles^ vans). Stat^^ ,5 P ^ respSi^ty^rtened or lengthened corre5)od- 

to defme the size aiid frequency of key-word clusters. This • i 

naakes it a mudi more automatic process the an editor *°&y- ... 

deciding on chisters of keywords for content provider's to 5?^^" ^L^^-^^P^i 

^ ^ This number could be changed to influence how the 

The same system can be used to set values for keyword „ sys^^ 

dusters. While groiqring words in this way would incur an Lei^ of time between accepting ncw.wd>-pagesubmis- 

tooeased administration cost, it is nevertheless computa- ^^'^ 

tionafly similar and oiily initiated once a certain level of hits If the date of submission for a web-pagp is too ckwe to the 

on a k^rword had been exceeded. existing submission for that web-page, then it is not 

Cbntent wily scardi Users can also purposely choose to ^ accepted. This length of time can be changed depcndmg on 

search only the content provider associated with a keyword. any of the above factors 

In this case the search results will be based on the vahies of Number of searches per day, per person (IP address or 

A in Table 12. TTie content providers that pay the most will user ID) that count as valid hits 

be at the top of the lisL This number can be change to reduce the possibility of 

Ihe key-word saggester can also be^ content piovideis q>amming 

choose key-'words or sets of key-words that they would like Len^ of time before renewing the security TM>le 

to display. security Table that restricts abuse, notes the links 

Cbntroiling the search engine system There are a number between keywords and IP addresses of us^ identificatioiK. 

of paraineters that can change the way in which the search The length of time between refreshing this T^ble can be 

engine according to the present invention ranks web pages. ^ dianged to make it harder to spam the system. 

These &ctois (described in detail below) are: The settings for these factois can be diffident for diffisient 

History factor keywords or gnmps of people depending on: 

lbs determines the rate of decay of the existing popular Frequency with which searches are done 

lists (popular hit list) as described in the text prevk>usly This The rate of-change of frequency of searches 

is a nun^r betweoi 1 and 0. Ahigh history &ctor will make ^ xhe price of the content 

it difiScult to change the existii^ popularity lists. As an The rate of change of price of content 

exanq>le if the rate of seardiiag for a particular keyword is -The piedse settmg of eadi of these factors will not be 

increasing quiddy, then the history factor ^ukl be bwer to known until the system begins operation 'learning' about the 

enable emerging web pages to rise iq» the popularity list. behaviors. The optimum settings for different situa- 

Frequency of updating Table 3 from the cumulative soifer 45 tions may be determined t>y experimentation, 

trace Other Applications 

Ibis is a measure of the frequency with which the Though the preferred embodiment has been described 

popularity lists are updated with information about the with reference to a software useable 00 a computer network 

users' activities (i.e. the surfer trace), for example, this may f^^f searching the Internet, it will t>e appreciated that the 

be measured once a day or even once a month depending on 50 iavmtion may be readily applied to any search system where 

the rate of change of popularity of particular ke3rword ^ human user diooses results from a set of initial search 

searchesw results. Such a system may for example be part of an, a LAN 

Sampling Frequency or WAN or even a database on an individual PC 

This is the frequency of sampling the infomaation of how Examples of other possible areas of application for the 

users are searching. If it is a common keyword it is not 55 present invention are described below, 

necessary to monitor every search. It may be that <Hily a Intranet Searches and Other Data Base Searches 

percentage of all seardies need be monitored to accurately Intranet searches at present suffer from similar drawbacks 

determine web-page popularity. firom Internet searches^ indeed some intranets can in them- 

The composition of the default search Est (mix of results selves be extremely substantial systems^ in which identify- 

from the new web-page list, high-flyeis and popular- 60 ing a particular information source or item can be equally 

lists etc.) problematic. Utilizing the present invention in such appH- 

The mix of web pages presented to the user as a default cations is within the intended scope of the present invention, 

can be chained if necessary to reflect the way in which Searching Other Media Forms 

search results evolve over time. The present invention is also intended to be applied 

Content * hit factor ' 65 matching a user's profile to other media sources (such as pay 

The "content hit factor" is a measure of the weighting pcr-vicw, television, videos, music and the like)^ thus allow- 

given to a hit 00 content being recorded as a hit for a ing content targeted to a particular audience. The same form 
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of search lists as described above (Popular-Iist» High-flyers, Internel could be more taigeted to differeot audiences. This 

Hot-off the press etc) may be employed to direct useis to profile type inforaiatioD could ako be used by wd>-page 

appropiiate material developers to customize their web-page for difFcrcnt sets of 

Shopping users. 

llie search techniques described herein can t>e imple- 5 People Matching Senrke 

mented in a consumer network to assist shoppers in select In another embodiment* the system according to the 

items from within one shop or among a Urge rramber of present invention can be used as a dating service andA>r a 

shops Instead of using a keyword-URL link Table, there method for matching people with similar inferences by 

would be used a keyword-item purchased link Table, that doing a statistical analysis to compare the individual pref- 
tbenrecordswhat-items were purchased after each shopping lO erences (Table 6) of groups of users. The individual past 

request (key-word). This embodiment also records where preference Tables* in this embodiment, would preferably be 

the user purchased the product Each tirne a shopper pur- normalized and compared to each other using a standard 

chased an item this would incremoit the popularity of that conelatioa coefGcient When compared to other users it 

item, using the same techniques described previously. would give a numerical indication of how similar their 

The profile type s in this embodiment can be i:^ed to 15 preferences are. 

record the types of purchases made by different sets of The same embodiment could also be used to Gnd infor- 

people. Obccould, for example, select a profile type axKl see mation about shoilar people from there past preferences 

what are the most commonly purchased items for a rar^e of Ibbksw For example one couM ask to be give the names of 

users* and would provide assistance in dKX)sing gifts for people in New Zealand with an interest in Ecological 

people who have a different profile type than yourselL 20 Economics and a seardi could be made of the personal 

Sciertfific Publkations preferences Tables. Such an embodiment, however, would 

Searyinr^ scientific daU bases (on-line papers* journals, typically include a passwonVconsent indicator that provides 

etc.) with the present invention will dramatically reduce the consent of identified persons to give out their infonnation* 

time ^nt examining obscure, or esoteric areas only to find wliicfa consent could be given, for example* in only certain 

the infonnation irrelevant e criteria for a valid hit for such 25 circuiristarices, which circurnstarKses are lirriited to seardieis 

uses would typically incorporate the extended time feature who have a level of authority and password indicating the 

described above to establish the usefulne^ of the informa- same, or for persons who identity themscilves with certain 

tioio source. The refereeing and referencing of academic/ cdiaracteristics. 

scientific papers u^ng the present invention could enhanced Wfaik the invention has been described in connection 

by dassifying difTcrent levek or types of user* e.g. Dr* ao with what is presenUy considered to be the most practical 

Professor etc. postgraduate* and so forth. This wiSi enable and preferred eiribodiments* it is understood that the inveii- 

users to see. for example* what infimnation sources the tion is not limited to the disdosed embodiment For example, 

eminent authorities in a particular field found of interest. It each of the features described above can be use singly or iii 

would also allows the authors of a p^r to become aware of combination* as set forth below in the claims* without other 

how ofien their publication was accessed and possibly 15 features described above which are patentably significant by 

further indicate where and how ofieri the paper was used as themselves. Accordingly, the present invention is intended to 

a reference in subsequent papers^ Users may have to for- cover various modifications and equivalent arrarigements 

roally register with differentorganizations to obtain levels of included within die spirit and scope of the appended claims, 

ability to referee. Users may also choose the level of We daim: 

refoeeing for their searching. 40 1. In a computer network having a plurality of user sites, 

Onlirie He^ a method of weighting the relative importance of a plurality 

Ibere is currently a substantial global requirement for of data items stored in a database 00 a server computer 

online help and support particulariy for computer/software comprising the steps ot 

applications. Sudi a need would be considerably assuaged rccchrmg at said server computer one w more keywords 

by use of the present invention as the software developers 45 fiom user site^ 

obtain a direct feedback to the type and fircquency of _ • ^ ±. i- . 

particular inquiries* whilst the uses i^ieive the^SuinXted ser^r computer a ^ty of hstmgs 

benefit of the previous users. Different profile type s would "^^"^^ keyword, eadi hstmg also cor- 

enabk the answers to be provided in ao appropriate form for to one of said data items; 

the user, e.g. novice* c^rt, etc. The keyword suggestcr so transmitting from said server computer to one of said user 

may* for example* suggest searching with key-words sites said plurality of listings 

(questions) more l^ely to yield a satisfactory refuse. detecting at said server computer winch ones of said 

There can be a range of answers to each question and as the plurality of data items are selected by said user site* 

system learns it will converge on to the best answers. site being transmitted each selected one of 

Question and Answer Services 55 said data items upon selection of said correqx>ndirig 

Cbxrent On-line question/answer programs could be con- listing by said user site; 

figured to run via the present invention thus enabling updatingsaid database to weight said selected ones of said 

answers to repeatedly a^ed questions to be based on daU items as relatively more important than unselected 

previous questions and wmilar questions to be suggested. ones of saki data items with respect to said keyword; 

Content Optimization on Other Parts of the Inteniet 60 wherein said step of detecting only detects eadi key- 

The same content bidding mechanism could be used to word one time finom each user site during a deter- 

detennine the price of content for aiiy location on the mined interval of time; arxi 

Internet* not just web page listings as identified above. In wherein said step of generating uses a history factor 

this embodiment* content providers will bid for a general assodated with each keyword in determining said 

content space to set the price automatically. 65 phirality of listings, wherein said history factor is a 

The profile type information from the search engine could number less than or equal to 1 and greater than or 

be used as a pas^rt so that other advertisements on the equal to 0. 
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2. In a computer network having a plurality of user sites 
and developer sites, a method of populating a database on a 
server computer oomprisii]^ the steps ot 

entering a phiraHty of data items into said database from 
said developer sites, each of said data items entered into 
said database including as associated identifiers a phi- 
raHty of associated keywords; 
updating said database by entering a plurality of user 
traces, each of said user traces identifying one of said 
daU items and an associated keyword so that each trace 
ipcieascs die relative inaportanoe of the associated data 
item with respect to said associated keyword, wherein 
said step of iq[>dating only li^dates each keyword one 
tinie for each user during a determined interval of time; 
and 

wherein said step of ii^xlating each keyword one time 
indodes the steps of: 



10 



15 
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associatii^ an identifier with each user, \^rein each of 
said user traces including the user identifier, wherein 
said user identifier is used to iqxlate a user table; and 

using said identifier to track keywords that have been 
entered by each usen 

3. Amethod according to daim 2, wherein said user table 
includes liser traces corresponding to a plurality of different 
types of searches performed by the user. 

4. A method according to daim 3 wherein one of the types 
of searches performed by the user oorrc^nds to keyword 
searches that have been most recently updated. 

5. A method according claim 2 wherein a particular user 
has a plurality of user identifiers, each user identifier cor- 
responding to a different user piofife. 



